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(B) FILING DATb. ^ 
(viii) .TTORNKV/AGENT INFORMATION 
(A) NAME: Haile, Ph-D- 
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(B) TELEFAX: 619/t>'° 

(C) TELEX: 

(2) INFO RMATION FOR SEQ ID NO:l: 

, ■ , qEOUENCE CHARACTERISTICS : 
i seuuhin^ h = se pairs 

(A) LENGTH: 52 Case 
B TYPE: nucleic acid 
C STRANDEDNESS : Single 
(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 
(xi) SEQUENCE DES CRXPTXO N: SEO ID^l: 

™*&TTAAC TATGGGGGCA GGTCCGAAAA 

CCGAGAATTC attaaagagg agaaattaac 

(2 ) INFORMATION FOR SEQ ID NO 

(i) SEQUENCE CHARACTERISTICS : 
1 (A) LENGTH: 31 base pairs 



52 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

CCGAGGATCC TCACCGCCCC CTGCGGGTGC G 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
{D} TOPOLOGY : linear 

(ii) MOLECULE TYPE : CDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGTTGGAT ATACTGCTTG TT 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CCGACGATCC TTATTTTTTA ACCAAATGTT CC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGATGATG GAATTCACTC GC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGGAGGATCC CTACAGTTCT AAAAGTCTTT TA 



(2) INFORMATION FOR SEQ ID NO;7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGAGAACC CTGACAATAA AC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 
CCGAGGATCC TTACACCCAC AGAACCCTTA C 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAAGGA AAGTCTCTTG TT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CCGAGGATCC TCAAGCTTCC TGGAGAATCA A 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCAAGA AATATCGCCG CT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
CCGAGGATCC TTAAGGCTTC TCGAGGTGGG GGTT 

{2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGTATAAA TGGATTATTG AGGG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 14 : 

CCGAGG ACTA AACATAGTCT AAGTAATTAG C 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAATC CTCCTCACCA AC 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 



# 



(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCGAGGATCC TCACAGGCTC AGAAGCCTTT G 31 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAAAAC TTAAAAAAGT ACCT 54 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 
CCGAGGATCC TCACCGCCCC CTGCGGGTGC G 31 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1. . .780 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG AGG GGG AGC GGA GTG CGG ATA CTT CTC ACC AAC GAT GAC GGC ATC 4 8 

Met Arg Gly Ser Gly Val Arg lie Leu Leu Thr Asn Asp Asp Gly He 
15 10 15 

TTT GCC GAG GGT CTG GGG GCT CTG CGC AAG ATG CTG GAG CCC GTG GCT 96 
Phe Ala Glu Gly Leu Gly Ala Leu Arg Lys Met Leu Glu Pro Val Ala 

20 25 30 

ACC CTT TAC GTG GTG GCT CCG GAC CGA GAG CGT AGC GCG GCC AGC CAT 14 4 

Thr Leu Tyr Val Val Ala Pro Asp Arg Glu Arg Ser Ala Ala Ser His 



35 40 45 

GCT ATC ACC GTT CAC CGC CCC CTG CGG GTG CGG GAG GCG GGT TTT CGC 192 
Ala lie Thr Val His Arg Pro Leu Arg Val Arg Glu Ala Gly Phe Arg 
50 55 60 

AGC CCC AGG CTT AAA GGC TGG GTA GTG GAC GGT ACC CCG GCC GAC TGC 240 
Ser Pro Arg Leu Lys Gly Trp Val Val Asp Gly Thr Pro Ala Asp Cys 
65 70 75 80 

GTC AAG CTG GGC CTG GAG GTA CTT TTG CCC GAA CGT CCA GAT TTC CTG 288 
Val Lys Leu Gly Leu Glu Val Leu Leu Pro Glu Arg Pro Asp Phe Leu 

85 90 95 

GTT TCG GGC ATA AAC TAC GGG CCC AAC CTG GGT ACC GAC GTA CTT TAC 33 6 

Val Ser Gly He Asn Tyr Gly Pro Asn Leu Gly Thr Asp Val Leu Tyr 

100 105 110 

TCC GGC ACC GTC TCG GCG GCC ATA GAA GGG GTA ATT AAC GGC ATT CCC 3 84 

Ser Gly Thr Val Ser Ala Ala He Glu Gly Val He Asn Gly He Pro 
115 120 125 

TCG GTG GCC GTA TCT TTG GCC ACG CGG CGG GAG CCG GAC TAT ACC TGG 43 2 

Ser Val Ala Val Ser Leu Ala Thr Arg Arg Glu Pro Asp Tyr Thr Trp 
130 135 140 

GCG GCC CGG TTC GTC CTG GTC CTG CTG GAG GAA CTG CGA AAA CAC CAA 4 80 

Ala Ala Arg Phe Val Leu Val Leu Leu Glu Glu Leu Arg Lys His Gin 
145 150 155 160 

CTG CCC CCA GGA ACC CTG CTC AAC GTC AAC GTG CCC GAC GGG GTG CCC 528 
Leu Pro Pro Gly Thr Leu Leu Asn Val Asn Val Pro Asp Gly Val Pro 

165 170 175 

CGC GGG GTC AAG GTG ACC AAA CTG GGA AGC GTA CGC TAC GTC AAC GTG 576 
Arg Gly Val Lys Val Thr Lys Leu Gly Ser Val Arg Tyr Val Asn Val 

180 185 190 

GTA GAC TGC CGC ACC GAC CCT CGG GGG AAG GCT TAC TAC TGG ATG GCG 62 4 

Val Asp Cys Arg Thr Asp Pro Arg Gly Lys Ala Tyr Tyr Trp Met Ala 
195 200 205 

GGA GAA CCA TTG GAG CTG GAC GGC AAC GAC TCC GAA ACC GAC GTC TGG 672 
Gly Glu Pro Leu Glu Leu Asp Gly Asn Asp Ser Glu Thr Asp Val Trp 
210 215 220 

GCG GTG CGA GAA GGC TAT ATT TCC GTA ACA CCG GTC CAG ATC GAC CTT 720 
Ala Val Arg Glu Gly Tyr He Ser Val Thr Pro Val Gin He Asp Leu 
225 230 235 240 

ACT AAC TAC GGC TTC CTG GAA GAA CTC AAA AAA TGG CGT TTC AAG GAT 768 
Thr Asn Tyr Gly Phe Leu Glu Glu Leu Lys Lys Trp Arg Phe Lys Asp 

245 250 255 

ATC TTT TCT TCT TAA 7 83 

He Phe Ser Ser 

260 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



I 



(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1 . . . 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG TTG GAT ATA CTG CTT GTT AAT GAT GAT GGC ATT TAT TCA AAT GGA 4 8 

Met Leu Asp lie Leu Leu Val Asn Asp Asp Gly He Tyr Ser Asn Gly 
15 10 15 

TTA ATA GCT TTG AAG GAT GCA TTA TTG GAA AAA TTT AAT GCG AGG ATT 96 
Leu He Ala Leu Lys Asp Ala Leu Leu Glu Lys Phe Asn Ala Arg He 

20 25 30 

ACT ATT GTA GCC CCA ACA AAT CAG CAG AGT GGT ATT GGT AGG GCA ATA 144 
Thr He Val Ala Pro Thr Asn Gin Gin Ser Gly He Gly Arg Ala He 
35 40 45 

AGT TTA TTC GAG CCG TTA AGG ATA ACT AAA ACC AAA TTA GCA GAT GGT 192 
Ser Leu Phe Glu Pro Leu Arg He Thr Lys Thr Lys Leu Ala Asp Gly 
50 55 60 

TCT TGG GGA TAT GCA GTT TCA GGA ACC CCA ACA GAT TGC GTT ATA TTG 24 0 

Ser Trp Gly Tyr Ala Val Ser Gly Thr Pro Thr Asp Cys Val He Leu 
65 70 75 80 

GGC ATT TAT GAG ATA TTA AAG AAG GTA CCT GAT GTA GTT ATA TCA GGA 2 88 

Gly He Tyr Glu He Leu Lys Lys Val Pro Asp Val Val lie Ser Gly 

85 90 95 

ATA AAC ATT GGA GAA AAC CTT GGG ACT GAA ATA ACA ACT TCT GGA ACG 336 
He Asn He Gly Glu Asn Leu Gly Thr Glu He Thr Thr Ser Gly Thr 

100 105 HO 

TTG GGG GCT GCG TTT GAA GGG GCC CAT CAT GGG GCT AAG GCA TTA GCA 3 84 

Leu Gly Ala Ala Phe Glu Gly Ala His His Gly Ala Lys Ala Leu Ala 
115 120 125 

TCA TCA CTC CAA GTT ACC TCT GAC CAT CTA AAG TTT AAA GAG GGG GAG 4 32 

Ser Ser Leu Gin Val Thr Ser Asp His Leu Lys Phe Lys Glu Gly Glu 
130 135 140 

ACC CCA ATA GAC TTC ACA GTC CCA GCA AGA ATT ACT GCA AAT GTT GTT 480 
Thr Pro He Asp Phe Thr Val Pro Ala Arg He Thr Ala Asn Val Val 
145 150 155 160 

GAG AAG ATG TTG GAT TAT GAT TTC CCA TGT GAT GTC GTC AAC TTA AAC 52 8 

Glu Lys Met Leu Asp Tyr Asp Phe Pro Cys Asp Val Val Asn Leu Asn 

165 170 175 

ATT CCA GAA GGA GCA ACA GAA AAG ACA CCG ATT GAA ATC ACA AGG TTG 57 6 

He Pro Glu Gly Ala Thr Glu Lys Thr Pro lie Glu lie Thr Arg Leu 

180 185 190 

GCA AGG AAA ATG TAT ACA ACA CAC GTT GAG GAA AGA ATA GAT CCA AGA 62 4 

Ala Arg Lys Met Tyr Thr Thr His Val Glu Glu Arg He Asp Pro Arg 
195 200 205 



GGG AGG AGT TAT TAT TGG ATT GAT GGG TAT CCT ATT TTA GAG GAA GAG 
Gly Arg Ser Tyr Tyr Trp He Asp Gly Tyr Pro He Leu Glu Glu Glu 
210 215 220 



672 



GAA GAC ACT GAT GTC TAT GTT GTT AGA AGA AAG GGA CAT ATT TCT CTA 72 0 

Glu Asp Thr Asp Val Tyr Val Val Arg Arg Lys Gly His lie Ser Leu 
225 230 235 240 

ACC CCA TTA ACA TTA GAC ACA AC A ATT AAA AAT TTA GAG GAA TTT AAG 76 8 

Thr Pro Leu Thr Leu Asp Thr Thr lie Lys Asn Leu Glu Glu Phe Lys 

245 250 255 

AAA AAA TAT GAG AGA ATA TTA AAT GAA TGA 7 98 

Lys Lys Tyr Glu Arg lie Leu Asn Glu 

260 265 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . . 762 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG ATG ATG GAA TTC ACT CGC GAG GGA ATA AAA GCT GCT GTA GAG GCA 4 8 

Met Met Met Glu Phe Thr Arg Glu Gly lie Lys Ala Ala Val Glu Ala 
15 10 15 

CTT CAA GGG TTA GGA GAG ATC TAC GTA GTT GCC CCA ATG TTT CAA AGG 96 
Leu Gin Gly Leu Gly Glu lie Tyr Val Val Ala Pro Met Phe Gin Arg 

20 25 30 

AGC GCA AGT GGA AGG GCA ATG ACC ATC CAC AGA CCT CTA AGG GCT AAA 144 
Ser Ala Ser Gly Arg Ala Met Thr lie His Arg Pro Leu Arg Ala Lys 
35 40 45 

AGA ATA AGT ATG AAC GGT GCA AAA GCA GCC TAT GCT TTG GAT GGA ATG 192 
Arg He Ser Met Asn Gly Ala Lys Ala Ala Tyr Ala Leu Asp Gly Met 
50 55 60 

CCC GTT GAT TGC GTT ATC TTT GCC ATG GCC AGA TTT GGA GAT TTC GAC 24 0 

Pro Val Asp Cys Val lie Phe Ala Met Ala Arg Phe Gly Asp Phe Asp 
65 70 75 80 

CTT GCA ATA AGT GGT GTA AAC TTG GGA GAA AAC ATG AGC ACC GAG ATA 2 88 

Leu Ala lie Ser Gly Val Asn Leu Gly Glu Asn Met Ser Thr Glu He 

85 90 95 

ACG GTT TCC GGG ACT GCA AGC GCT GCA ATA GAG GCT GCA ACC CAA GAG 3 36 

Thr Val Ser Gly Thr Ala Ser Ala Ala He Glu Ala Ala Thr Gin Glu 

100 105 HO 

ATC CCA AGC ATT CCC ATA AGC CTG GAA GTT AAT AGA GAA AAA CAC AAA 3 84 

He Pro Ser He Pro He Ser Leu Glu Val Asn Arg Glu Lys His Lys 
115 120 125 

TTT GGT GAG GGC GAA GAG ATT GAC TTC TCA GCT GCC AAG TAT TTC CTA 43 2 

Phe Gly Glu Gly Glu Glu lie Asp Phe Ser Ala Ala Lys Tyr Phe Leu 
130 135 140 



AGA AAA ATC GCA ACG GCG GTT TTA AAG AGA GGC CTC CCC AAA GGA GTC 



480 
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„ g t j, II. Al, Thr Al» V.1 L,u Lys Arg My L.u Pro Ly. «y V.l 
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,» CTT « A=A r CTT ACC □» ACT CCA ATA AAC ATT CAT AT. ACA 

Lys val Glu Arg Lys Val Ser vai inr riu ^ 24Q 

Z AGA GTG AAT TTA GAC GAG ATT AAA AGA CTT TTA GAA CTG 

Ala Arg Val Asn Leu Asp Glu lie Lys ary 

245 ^ u 



TAG 



528 



576 



624 



672 



720 



762 



765 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

{A) NAME/ KEY : Coding Sequence 
(B) LOCATION: 1. - .813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

K2 £ S ™ £ S ^ £ Z S SS % S SI ^ S 
S* 25 S ?S £ Asp 25 SJ S| ?55 S - S ~ S K 
S !5 S S S i2 SS S S Si SI S !5 !S S S 

- S £ Arg E S £ i £ S S? tS ss s s s 

Si £ S 5! S !S £ SI S ffi K ili K - S 

65 70 



48 



96 



144 



192 



240 



ATA GAT GGA ACT CCT ACT GAC TGC GTG ATT TTC GCC ATA GCC CGC TTC 
He Asp Gly Thr Pro Thr Asp Cys Val He Phe Ala He Ala Arg Phe 

85 90 95 

GGG AGC TTT GGT TTA GCC GTG AGC GGG ATT AAC CTC GGC GAG AAC CTG 
Gly Ser Phe Gly Leu Ala Val Ser Gly He Asn Leu Gly Glu Asn Leu 

100 105 HO 

AGC ACC GAG ATA ACA GTC TCA GGG ACG GCC TCC GCT GCC ATA GAG GCC 
Ser Thr Glu He Thr Val Ser Gly Thr Ala Ser Ala Ala He Glu Ala 
115 120 125 

TCA ACT CAT GGA ATT CCG AGC ATA GCG ATT AGC CTT GAG GTG GAG TGG 
Ser Thr His Gly He Pro Ser He Ala He Ser Leu Glu Val Glu Trp 
130 135 140 

AAG AAG ACC CTC GGC GAG GGT GAG GGG GTT GAC TTC TCG GTC TCG ACT 
Lys Lys Thr Leu Gly Glu Gly Glu Gly Val Asp Phe Ser Val Ser Thr 
145 150 155 160 

CAC TTC CTC AAG AGA ATC GCG GGA GCC CTC TTG GAG AGA GGT CTT CCT 
His Phe Leu Lys Arg He Ala Gly Ala Leu Leu Glu Arg Gly Leu Pro 

165 170 175 

GAG GGC GTT GAC ATG CTC AAC GTC AAC GTT CCG AGC GAC GCG ACG GAG 
Glu Gly Val Asp Met Leu Asn Val Asn Val Pro Ser Asp Ala Thr Glu 

180 185 190 

GAA ACG GAG ATA GCA ATC ACC CGC TTA GCC CGG AAG CGC TAC TCC CCA 
Glu Thr Glu He Ala He Thr Arg Leu Ala Arg Lys Arg Tyr Ser Pro 
195 200 205 

ACG GTC GAG GAG AGG ATT GAC CCC AAG GGC AAC CCC TAC TAC TGG ATT 
Thr Val Glu Glu Arg lie Asp Pro Lys Gly Asn Pro Tyr Tyr Trp lie 
210 215 220 

GTC GGC AAA CTT GTC CAA GAC TTC GAG CCA GGG ACA GAT GCC TAC GCC 
Val Gly Lys Leu Val Gin Asp Phe Glu Pro Gly Thr Asp Ala Tyr Ala 
225 230 235 240 

CTG AAG GTC GAG AGG AAG GTC AGC GTC ACG CCG ATA AAC ATA GAT ATG 
Leu Lys Val Glu Arg Lys Val Ser Val Thr Pro lie Asn lie Asp Met 

245 250 255 

ACT GCG AGG GTG GAC TTT GAG GAG CTT GTA AGG GTT CTG TGG GTG 
Thr Ala Arg Val Asp Phe Glu Glu Leu Val Arg Val Leu Trp Val 

260 265 270 

TAA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1, , .1491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



ATG AAA GGA AAG TCT CTT GTT AGC GGT CTG TTG TTG GGT CTT TTA ATT 
Met Lys Gly Lys Ser Leu Val Ser Gly Leu Leu Leu Gly Leu Leu lie 
15 10 15 

TTG AGC CTG ATT TCA TTC CAG CCA AGC TTT GCA TAC TCC CCA CAC GGC 
Leu Ser Leu He Ser Phe Gin Pro Ser Phe Ala Tyr Ser Pro His Gly 

20 25 30 

GGT GTC AAA AAC ATC ATA ATC CTG GTT GGA GAC GGC ATG GGT CTT GGG 
Gly Val Lys Asn He He He Leu Val Gly Asp Gly Met Gly Leu Gly 
35 40 45 

CAT GTA GAA ATT ACA AAG CTC GTT TAT GGA CAC TTA AAC ATG GAA AAC 
His Val Glu He Thr Lys Leu Val Tyr Gly His Leu Asn Met Glu Asn 
50 55 60 

TTT CCA GTT ACT GGA TTT GAG CTT ACT GAT TCC CTA AGT GGT GAA GTT 
Phe Pro Val Thr Gly Phe Glu Leu Thr Asp Ser Leu Ser Gly Glu Val 
65 70 75 80 

ACA GAT TCT GCT GCG GCA GGA ACT GCA ATA TCC ACT GGA GCT AAA ACG 
Thr Asp Ser Ala Ala Ala Gly Thr Ala He Ser Thr Gly Ala Lys Thr 

85 90 95 

TAT AAT GGT ATG ATT TCA GTA ACC AAC ATA ACC GGA AAG ATA GTT AAC 
Tyr Asn Gly Met He Ser Val Thr Asn He Thr Gly Lys He Val Asn 

100 105 110 

TTA ACA ACC CTA CTT GAA GTG GCT CAA GAG CTT GGG AAG TCA ACA GGG 
Leu Thr Thr Leu Leu Glu Val Ala Gin Glu Leu Gly Lys Ser Thr Gly 
115 120 125 

CTG GTC ACC ACA ACA AGG ATT ACC CAT GCA ACT CCA GCA GTT TTT GCG 
Leu Val Thr Thr Thr Arg He Thr His Ala Thr Pro Ala Val Phe Ala 
130 135 140 

TCC CAT GTC CCA GAT AGG GAT ATG GAG GGG GAG ATA CCC AAG CAA CTC 
Ser His Val Pro Asp Arg Asp Met Glu Gly Glu He Pro Lys Gin Leu 
145 150 155 160 

ATA ATG CAC AAA GTT AAC GTC TTG TTG GGT GGT GGA AGG GAG AAA TTC 
He Met His Lys Val Asn Val Leu Leu Gly Gly Gly Arg Glu Lys Phe 

165 170 175 

GAT GAG AAA AAT TTG GAG CTG GCC AAA AAG CAG GGA TAC AAA GTA GTT 
Asp Glu Lys Asn Leu Glu Leu Ala Lys Lys Gin Gly Tyr Lys Val Val 

180 185 190 

TTC ACG AAG GAA GAG CTT GAA AAA GTT GAA GGA GAT TAT GTC CTA GGA 
Phe Thr Lys Glu Glu Leu Glu Lys Val Glu Gly Asp Tyr Val Leu Gly 
195 200 205 

CTC TTT GCA GAA AGT CAC ATC CCT TAC GTA TTG GAT AGA AAA CCC GAT 
Leu Phe Ala Glu Ser His lie Pro Tyr Val Leu Asp Arg Lys Pro Asp 
210 215 220 

GAT GTT GGA CTT TTA GAA ATG GCC AAA AAG GCA ATT TCA ATA CTC GAG 
Asp Val Gly Leu Leu Glu Met Ala Lys Lys Ala He Ser He Leu Glu 
225 230 235 240 



AAG AAC CCG AGC GGA TTC TTT CTC ATG GTT GAG GGC GGA AGG ATT GAC 
Lys Asn Pro Ser Gly Phe Phe Leu Met Val Glu Gly Gly Arg lie Asp 



245 250 255 

CAT GCA GCC CAT GGA AAC GAT GTC GCA TCG GTT GTT GCA GAA ACT AAG 
His Ala Ala His Gly Asn Asp Val Ala Ser Val Val Ala Glu Thr Lys 

260 265 270 

GAG TTT GAC GAT GTT GTC AGA TAC GTG CTG GAA TAT CCG AAG AAG AGG 
Glu Phe Asp Asp Val Val Arg Tyr Val Leu Glu Tyr Pro Lys Lys Arg 
275 280 285 

GGA GAT ACC TTG GTA ATA GTG CTT GCC GAT CAC GAA ACT GGA GGT CTT 
Gly Asp Thr Leu Val lie Val Leu Ala Asp His Glu Thr Gly Gly Leu 
290 295 300 

GCA ATA GGT CTA ACG TAT GGA AAT GCA ATC GAT GAA GAT GCC ATA AGA 
Ala He Gly Leu Thr Tyr Gly Asn Ala He Asp Glu Asp Ala He Arg 
305 310 315 320 

AAA ATA AAA GCA AGC ACG TTG AGG ATG CCC AAA GAG GTT AAG GCA GGG 
Lys He Lys Ala Ser Thr Leu Arg Met Pro Lys Glu Val Lys Ala Gly 

325 330 335 

AGT AGT GTA AAA GAG TCC TCA AAG GTA TGC CGG ATT TGT CCC AAC AGA 
Ser Ser Val Lys Glu Ser Ser Lys Val Cys Arg He Cys Pro Asn Arg 

340 345 350 

GGA AGA AGT CAG TAT ATT GAG AAT GCG CTG CAC TCG ACA AAC AAG TAT 
Gly Arg Ser Gin Tyr lie Glu Asn Ala Leu His Ser Thr Asn Lys Tyr 
355 360 365 

GCC CTC TCA AAT GCA GTA GCC GAT GTT ATA AAC AGG CGT ATT GGT GTT 
Ala Leu Ser Asn Ala Val Ala Asp Val He Asn Arg Arg He Gly Val 
370 375 380 

GGA TTC ACC TCC TAT GAG CAT ACA GGA GTT CCA GTT CCG CTC TTA GCT 
Gly Phe Thr Ser Tyr Glu His Thr Gly Val Pro Val Pro Leu Leu Ala 
385 390 395 400 

TAC GGT CCC GGG GCA GAG AAC TTC AGA GGT TTC TTA CAC CAT GTG GAT 
Tyr Gly Pro Gly Ala Glu Asn Phe Arg Gly Phe Leu His His Val Asp 

405 410 415 

ACA GCA AGA TTA GTT GCA AAG TTA ATG CTC TTT GGA AGG AGG AAT ATT 
Thr Ala Arg Leu Val Ala Lys Leu Met Leu Phe Gly Arg Arg Asn He 

420 425 430 

CCA GTT ACC ATT TCA AGC GTG AGC AGT GTT AAG GGA GAC ATA ACC GGT 
Pro Val Thr He Ser Ser Val Ser Ser Val Lys Gly Asp He Thr Gly 
435 440 445 

GAT TAC AGG GTT GAT GAG AAG GAT GCC TAC GTT ACG CTC ATG ATG TTT 
Asp Tyr Arg Val Asp Glu Lys Asp Ala Tyr Val Thr Leu Met Met Phe 
450 455 460 

CTC GGA GAA AAA GTG GAT AAT GAA ATT GAA AAG AGA GTC GAT ATA GAC 
Leu Gly Glu Lys Val Asp Asn Glu He Glu Lys Arg Val Asp He Asp 
465 470 475 480 

AAC AAC GGC ATG GTT GAC TTA AAT GAC GTC ATG TTG ATT CTC CAG GAA 
Asn Asn Gly Met Val Asp Leu Asn Asp Val Met Leu He Leu Gin Glu 

485 490 495 



GCT TGA 
Ala 



(2) INFORMATION FOR SEQ ID NO: 24: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 1. . .1752 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:24; 



ATG CCA AGA AAT ATC GCC GCT GTA TGC GCC CTG GCC GCT TTG TTA GGG 
Met Pro Arg Asn He Ala Ala Val Cys Ala Leu Ala Ala Leu Leu Gly 
15 10 15 

TCG GCC TGG GCG GCC AAA GTT GCC GTC TAC CCC TAC GAC GGA GCC GCT 
Ser Ala Trp Ala Ala Lys Val Ala Val Tyr Pro Tyr Asp Gly Ala Ala 

20 25 30 

TTG CTG GCG GGG CAG CGC TTC GAT TTG CGC ATA GAA GCC TCC GAG CTG 
Leu Leu Ala Gly Gin Arg Phe Asp Leu Arg He Glu Ala Ser Glu Leu 
35 40 45 

AAA GGC AAT TTA AAG GCT TAC CGC ATC ACC CTG GAC GGC CAG CCT CTG 
Lys Gly Asn Leu Lys Ala Tyr Arg He Thr Leu Asp Gly Gin Pro Leu 
50 55 60 

GCG GGC CTC GAG CAA ACC GCG CAG GGG GCC GGG CAG GCC GAG TGG ACC 
Ala Gly Leu Glu Gin Thr Ala Gin Gly Ala Gly Gin Ala Glu Trp Thr 
65 70 75 80 

CTG CGC GGT GCC TTC CTG CGC CCT GGA AGC CAC ACC CTC GAG GTC AGC 
Leu Arg Gly Ala Phe Leu Arg Pro Gly Ser His Thr Leu Glu Val Ser 

85 90 95 

CTC ACC GAC GAC GCT GGG GAG AGC AGG AAG AGC GTA CGT TGG GAG GCT 
Leu Thr Asp Asp Ala Gly Glu Ser Arg Lys Ser Val Arg Trp Glu Ala 

100 105 HO 



CGG CAG AAC CTT CGC TTG CCC CGA GCG GCC AAG AAT GTG ATT CTC TTC 
Arg Gin Asn Leu Arg Leu Pro Arg Ala Ala Lys Asn Val He Leu Phe 
115 120 125 

ATT GGC GAC GGG ATG GGC TGG AAC ACC CTC AAC GCC GCC CGC ATC ATC 
He Gly Asp Gly Met Gly Trp Asn Thr Leu Asn Ala Ala Arg lie lie 
130 135 140 

GCC AAA GGC TTT AAC CCC GAA AAC GGT ATG CCC AAC GGA AAC CTC GAG 
Ala Lys Gly Phe Asn Pro Glu Asn Gly Met Pro Asn Gly Asn Leu Glu 
145 150 155 160 

ATC GAG AGT GGT TAC GGT GGG ATG GCT ACC GTC ACT ACC GGC AGC TTT 
He Glu Ser Gly Tyr Gly Gly Met Ala Thr Val Thr Thr Gly Ser Phe 

165 170 175 

GAT AGC TTC ATC GCC GAC TCA GCT AAC TCG GCT TCT TCC ATC ATG ACC 
Asp Ser Phe He Ala Asp Ser Ala Asn Ser Ala Ser Ser He Met Thr 

180 185 190 



GGG CAG AAG 
Gly Gin Lys 
195 

AAA GAT ACC 
Lys Asp Thr 
210 

AAG CGG GTA 
Lys Arg Val 
225 

ACC GAC GCT 
Thr Asp Ala 



GAT TAC CAG 
Asp Tyr Gin 



GTT CCC TTG 
Val Pro Leu 
275 

CAG AGC ACC 
Gin Ser Thr 
290 

GAA TCC CAG 
Glu Ser Gin 
305 

CTG GCG GCC 
Leu Ala Ala 



AAC TTC CCC 
Asn Phe Pro 



CTG GGA AGC 
Leu Gly Ser 
355 

GCC GTG GAG 
Ala Val Glu 
370 

GAG GGG GGA 
Glu Gly Gly 
385 

GCA CTT TGG 
Ala Leu Trp 



GGC TAT GCG 
Gly Tyr Ala 



CAC GCT CAC 
His Ala His 
435 

GGC CGG GAG 



GTG CAG GTG 
Val Gin Val 



CTG GCC TAC 
Leu Ala Tyr 



CGC GGG GCC 
Arg Gly Ala 
230 

ACC CCG GCT 
Thr Pro Ala 
245 

GCT ATC GCC 
Ala lie Ala 
260 

GAT GTG ATG 
Asp Val Met 



CCT GGC TCG 
Pro Gly Ser 



AAG CTG GGC 
Lys Leu Gly 
310 

AAA CCC ACC 
Lys Pro Thr 
325 

AGC TAC CTA 
Ser Tyr Leu 
340 

TTT ACC GAT 
Phe Thr Asp 



GCT CTC TCC 
Ala Leu Ser 



ATG GTG GAT 
Met Val Asp 
390 

GAT GTA CTC 
Asp Val Leu 
405 

GCC TCC CAC 
Ala Ser His 
420 

TCG ATC TCG 
Ser lie Ser 



GGG GTG GGG 



AAT GCC CTC 
Asn Ala Leu 
200 

CCC CGG ATC 
Pro Arg lie 
215 

AGC ATT GGG 
Ser He Gly 



TCA CTC AAC 
Ser Leu Asn 



GAC ATG TAC 
Asp Met Tyr 
265 

CTC TTC GGT 
Leu Phe Gly 
280 

CGG CGC AAG 
Arg Arg Lys 
295 

TAC ACC TTT 
Tyr Thr Phe 



GAT AAG CTG 
Asp Lys Leu 



GAC CGC GCA 
Asp Arg Ala 
345 

ATG CCC TAC 
Met Pro Tyr 
360 

AGA AAC GAC 
Arg Asn Asp 
375 

AAG TAC GAG 
Lys Tyr Glu 



GAG CTG GAC 
Glu Leu Asp 



CCC GAT ACC 
Pro Asp Thr 
425 

GTG TTT GGC 
Val Phe Gly 
440 

GTT TAT GAG 



AAC GTT TAC 
Asn Val Tyr 



GAA ACC CTA 
Glu Thr Leu 
220 

GTA GTG ACC 
Val Val Thr 
235 

GCC CAT ACC 
Ala His Thr 
250 

TTT GGT AGA 
Phe Gly Arg 



GGT TCA CGC 
Gly Ser Arg 



GAT AGC ACG 
Asp Ser Thr 
300 

GTC AGC ACC 
Val Ser Thr 
315 

TTT GGG CTG 
Phe Gly Leu 
330 

GTG TGG AAG 
Val Trp Lys 



CTC TGG GAG 
Leu Trp Glu 



AAA GGC TTT 
Lys Gly Phe 
380 

CAC CCC TTG 
His Pro Leu 
395 

CGC GCG GTG 
Arg Ala Val 
410 

CTG GTG ATT 
Leu Val He 



GGT TAC GAC 
Gly Tyr Asp 



GCC GCC AAG 



CCA TCA AAC 
Pro Ser Asn 
205 

GCG GAG ATG 
Ala Glu Met 



ACC ACC TTC 
Thr Thr Phe 



CGC CGC CGC 
Arg Arg Arg 
255 

GGC GGG TTC 
Gly Gly Phe 
270 

GAC TTC ATC 
Asp Phe He 
285 

GAC TGG ATT 
Asp Trp He 



CGC AGC GAG 
Arg Ser Glu 



TTC AAC ATT 
Phe Asn lie 
335 

CGG CCC GAG 
Arg Pro Glu 
350 

ATG ACC CAG 
Met Thr Gin 
365 

TTC TTG ATG 
Phe Leu Met 



GAC TGG CCC 
Asp Trp Pro 



GCT TGG GCC 
Ala Trp Ala 
415 

GTC ACC GCC 
Val Thr Ala 
430 

TAC TCC AAG 
Tyr Ser Lys 
445 

TTC CCC ACC 



CTC 624 
Leu 



CTC 672 
Leu 



GGC 72 0 

Gly 

240 

GGT 768 
Gly 



GGT 816 
Gly 



CCC 864 
Pro 



GCC 912 
Ala 



CTG 960 

Leu 

320 

GAC 1008 
Asp 



ATG 105 6 
Met 



AAA 1104 
Lys 



GTT 1152 
Val 



CGC 120 0 

Arg 

400 

AAG 1248 
Lys 



GAC 12 96 
Asp 



CAG 1344 
Gin 



TAC 1392 



Gly Arg Glu Gly Val Gly Val Tyr Glu Ala Ala Lys Phe Pro Thr Tyr 
450 455 460 



GGC GAC AAA AAA GAC GCC AAC GGC TTT CCC TTG CCC GAC ACC ACT CGG 
Gly Asp Lys Lys Asp Ala Asn Gly Phe Pro Leu Pro Asp Thr Thr Arg 
465 470 475 480 



GGA 
Gly 



ATC 
lie 



GCG 
Ala 



GTA 
Val 



GGC 
Gly 
485 



TTC 
Phe 



GGG 
Gly 



GCC ACG CCG GAT TAC TGT GAA 
Ala Thr Pro Asp Tyr Cys Glu 
490 



ACC TAC 
Thr Tyr 
495 



CGG GGC CGC 
Arg Gly Arg 



GAG 
Glu 
500 



GTC TAC AAA GAC 
Val Tyr Lys Asp 



CCC 
Pro 
505 



ACC 
Thr 



ATC 
lie 



TCC 
Ser 



GAC 
Asp 



GGC 
Gly 
510 



AAA 
Lys 



GGT 
Gly 



GGT TAC GTG GCC AAC CCT GAG GTC TGC AAG GAG CCG GGC CTT CCA ACG 
Gly Tyr Val Ala Asn Pro Glu Val Cys Lys Glu Pro Gly Leu Pro Thr 
515 520 525 

TAC CGG CAA CTC CCA GTA GAT AGC GCC CAG GGC GTG CAC ACG GCT GAT 
Tyr Arg Gin Leu Pro Val Asp Ser Ala Gin Gly Val His Thr Ala Asp 
530 535 540 

CCC ATG CCG CTG TTT GCC TTT GGC GTG GGG TCT CAG TTC TTC AAT GGC 
Pro Met Pro Leu Phe Ala Phe Gly Val Gly Ser Gin Phe Phe Asn Gly 
545 550 555 560 

CTC ATC GAC CAG ACC GAG ATC TTC TTC CGC ATG GCC CAG GCC CTA GGG 
Leu lie Asp Gin Thr Glu He Phe Phe Arg Met Ala Gin Ala Leu Gly 

565 570 575 



TTC AAC CCC CAC CTC GAG AAG CCT TAA 
Phe Asn Pro His Leu Glu Lys Pro 

580 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 1. . ,903 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 



ATG TAT 
Met Tyr 
1 



AGC 
Ser 



CTA 
Leu 



AAA 
Lys 

GGT 
Gly 



TGG 
Trp 



GAA 
Glu 
20 



ATT 
He 
5 

CTA 
Leu 



ATT GAG GGT AAG 
He Glu Gly Lys 



GCC 
Ala 



GAT 
Asp 



CTC 
Leu 



AAA 
Lys 
25 



CTT 
Leu 
10 

AGA 
Arg 



GCC 
Ala 



CTT 
Leu 



CAA 
Gin 



TTC 
Phe 



GCA 
Ala 



GAC 
Asp 



CCT 
Pro 



GCC 
Ala 
30 



TTT CCA 
Phe Pro 
15 



ATT 
He 



ATT 
He 



GTT CTT ACA ATG CCG CAT GAA CAA CCG CTT AAT GAG AAA TAT ATC GAG 
Val Leu Thr Met Pro His Glu Gin Pro Leu Asn Glu Lys Tyr lie Glu 
35 40 45 



ATA TTA GAG AGC CAT GGA TTC CAA GTC CTC CAT GTC CCC ACG CTC GAC 
He Leu Glu Ser His Gly Phe Gin Val Leu His Val Pro Thr Leu Asp 



50 55 60 

TTT CAT CCT TTA GAA CTC TTC GAC CTT TTG AAA ACA AGC ATA TTC ATT 24 0 

Phe His Pro Leu Glu Leu Phe Asp Leu Leu Lys Thr Ser lie Phe lie 
65 70 75 80 

GAT GAA AAC CTG GAG AGA TCC CAC AGA GTG CTT GTC CAC TGC ATG GGA 28 8 

Asp Glu Asn Leu Glu Arg Ser His Arg Val Leu Val His Cys Met Gly 

85 90 95 

GGC ATA GGC CGG AGC GGG CTT GTA ACT GCT GCG TAC TTA ATA TTC AAA 33 6 

Gly He Gly Arg Ser Gly Leu Val Thr Ala Ala Tyr Leu He Phe Lys 

100 105 HO 

GGT TAT GAT ATT TAC GAC GCG GTA AAG CAT GTG AGA ACG GTA GTG CCT 3 84 

Gly Tyr Asp He Tyr Asp Ala Val Lys His Val Arg Thr Val Val Pro 
115 120 125 

GGT GCT ATT GAA AAC AGA GGG CAA GCG TTA ATG CTT GAG AAC TAC TAT 4 32 

Gly Ala He Glu Asn Arg Gly Gin Ala Leu Met Leu Glu Asn Tyr Tyr 
130 135 140 

ACC CTG GTC AAA AGT TTC AAC AGA GAG TTG CTG AGA GAC TAC GGG AAG 48 0 

Thr Leu Val Lys Ser Phe Asn Arg Glu Leu Leu Arg Asp Tyr Gly Lys 
145 150 155 160 

AAA ATT TTC ACG CTC GGT GAC CCG AAG GCG GTT CTC CAC GCT TCT AAG 52 8 

Lys He Phe Thr Leu Gly Asp Pro Lys Ala Val Leu His Ala Ser Lys 

165 170 175 

ACG ACT CAG TTC ACG ATT GAA CTC TTA AGC AAC TTA CAC GTC AAC GAG 57 6 

Thr Thr Gin Phe Thr He Glu Leu Leu Ser Asn Leu His Val Asn Glu 

180 185 190 

GCG TTT TCA ATC AGT GCG ATG GCT CAA TCA CTG CTC CAC TTT CAC GAC 624 
Ala Phe Ser He Ser Ala Met Ala Gin Ser Leu Leu His Phe His Asp 
195 200 205 

GTA AAA GTC CGC TCT AAA CTG AAA GAA GTA TTC GAA AAC ATG GAA TTC 672 
Val Lys Val Arg Ser Lys Leu Lys Glu Val Phe Glu Asn Met Glu Phe 
210 215 220 

TCA TCC GCC TCA GAG GAG GTT CTG TCA TTT ATT CAC CTA CTC GAT TTC 720 
Ser Ser Ala Ser Glu Glu Val Leu Ser Phe He His Leu Leu Asp Phe 
225 230 235 240 

TAT CAG GAT GGC AGG GTT GTT TTA ACC ATT TAC GAT TAT CTC CCC GAT 768 
Tyr Gin Asp Gly Arg Val Val Leu Thr He Tyr Asp Tyr Leu Pro Asp 

245 250 255 

AGG GTG GAT TTG ATT TTA TTG TGT AAG TGG GGT TGT GAT AAA ATA GTT 816 
Arg Val Asp Leu He Leu Leu Cys Lys Trp Gly Cys Asp Lys He Val 

260 265 270 

GAA GTC TCG TCT TCA GCG AAG AAA ACC GTT GAG AAG CTT GTA GGA AGA 864 
Glu Val Ser Ser Ser Ala Lys Lys Thr Val Glu Lys Leu Val Gly Arg 
275 280 285 

AAG GTT TCC CTA TCC TGG GCT AAT TAC TTA GAC TAT GTT TAG 906 
Lys Val Ser Leu Ser Trp Ala Asn Tyr Leu Asp Tyr Val 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 1. . .771 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



ATG AGA ATC CTC CTC ACC AAC GAC GAC GGC ATC TAT TCC AAC GGT CTG 4 8 

Met Arg lie Leu Leu Thr Asn Asp Asp Gly lie Tyr Ser Asn Gly Leu 

15 10 15 

CGC GCG GCG GTG AAG GGC CTG AGC GAG CTC GGC GAG GTC TAC GTC GTC 96 

Arg Ala Ala Val Lys Gly Leu Ser Glu Leu Gly Glu Val Tyr Val Val 

20 25 30 



GCC CCG CTC TTC CAG AGG AGC GCG AGC GGT CGG GCG ATG ACC CTA CAC 
Ala Pro Leu Phe Gin Arg Ser Ala Ser Gly Arg Ala Met Thr Leu His 
35 40 45 



144 



AGG CCG ATA AGG GCA AAG AGG GTT GAC GTT CCC GGC GCG AAG ATA GCG 
Arg Pro lie Arg Ala Lys Arg Val Asp Val Pro Gly Ala Lys lie Ala 
50 55 60 



192 



TAT GGC ATA GAC GGA ACG CCG ACC GAC TGC GTG ATT TTT GCC ATC GCC 
Tyr Gly lie Asp Gly Thr Pro Thr Asp Cys Val lie Phe Ala lie Ala 
65 70 75 80 



240 



CGC TTC GGC GAC TTT GAT CTG GCG GTC AGC GGG ATA AAC CTA GGC GAG 
Arg Phe Gly Asp Phe Asp Leu Ala Val Ser Gly lie Asn Leu Gly Glu 

85 90 95 



288 



AAC CTG AGC ACG GAG ATA ACC GTC TCC GGA ACG GCC TCG GCG GCG ATA 
Asn Leu Ser Thr Glu He Thr Val Ser Gly Thr Ala Ser Ala Ala He 

100 105 HO 



336 



GAG GCT TCC ACC CAC GGG ATT CCA AGT GTA GCT ATA AGC CTC GAG GTC 
Glu Ala Ser Thr His Gly He Pro Ser Val Ala He Ser Leu Glu Val 
115 120 125 



384 



GAG TGG AAG 
Glu Trp Lys 
130 

TCA GCA CAC 
Ser Ala His 
145 

CTG CCT GAA 
Leu Pro Glu 



AGC GAG GGG 
Ser Glu Gly 



TCT CCG ACG 
Ser Pro Thr 
195 



AAG ACC CTC 
Lys Thr Leu 



TTC CTG AGA 
Phe Leu Arg 
150 

GGG GTG GAC 
Gly Val Asp 
165 

ACT GAG ATC 
Thr Glu He 
180 

ATA GAG GAG 
lie Glu Glu 



GGC GAG GGG 
Gly Glu Gly 
135 

AGG ATA GCG 
Arg He Ala 



ATG CTC AAC 
Met Leu Asn 



GCC ATA ACG 
Ala He Thr 
185 

AGG ATA GAC 
Arg He Asp 
200 



GAG GGT ATT 
Glu Gly He 
140 

ACG GCT GTC 
Thr Ala Val 
155 

GTG AAC GTC 
Val Asn Val 
170 

CGC CTC GCG 
Arg Leu Ala 



CCC AAG GGC 
Pro Lys Gly 



GAC TTC TCG 
Asp Phe Ser 



CTT AAG AAG 
Leu Lys Lys 



CCT AGC GAC 
Pro Ser Asp 
175 

AGG AAG CGC 
Arg Lys Arg 
190 

AAC CCC TAC 
Asn Pro Tyr 
205 



GTT 432 
Val 



GGC 480 

Gly 

160 

GCC 52 8 

Ala 



TAT 576 
Tyr 



TAC 624 
Tyr 



TGG ATC 
Trp lie 
210 

TAC GCT 
Tyr Ala 
225 



GAC 
Asp 



ATG 
Met 



GTT 
Val 



CTG 
Leu 



ACT 
Thr 



GGC 
Gly 



AAA 
Lys 



GCG 
Ala 



AGG 
Arg 



GTC 
Val 



AGG 
Arg 
245 



CTC 
Leu 



GAG 
Glu 
230 

GTT 
Val 



GTC 
Val 
215 



CAG 
Gin 



AGA AAG 
Arg Lys 



GAC 
Asp 



TTT 
Phe 



GAG 
Glu 



GTC 
Val 



GAG 
Glu 



TTC 
Phe 



AGC 
Ser 



AAC 
Asn 
250 



GAG 
Glu 



GTC 
Val 
235 

CTT 
Leu 



CCG 
Pro 
220 

ACG 
Thr 



CAA 
Gin 



GGC 
Gly 



CCC 
Pro 



AGG 
Arg 



ACG 
Thr 



ATA 
He 



CTT 
Leu 



GAC 
Asp 



GCC 
Ala 



AAC ATC 
Asn He 
240 

CTG AGC 
Leu Ser 
255 



CTG TGA 
Leu 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 ... 792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG GAA AAC TTA AAA AAG TAC CTA GAA GTT GCA AAA ATA GCC GCG CTC 
Met Glu Asn Leu Lys Lys Tyr Leu Glu Val Ala Lys He Ala Ala Leu 
15 10 15 

GCG GGT GGG CAG GTT CTG AAA GAA AAC TTC GGA AAG GTA AAA AAG GAA 
Ala Gly Gly Gin Val Leu Lys Glu Asn Phe Gly Lys Val Lys Lys Glu 

20 25 30 

AAC ATA GAG GAA AAA GGG GAA AAG GAC TTT GTA AGT TAC GTG GAT AAA 
Asn He Glu Glu Lys Gly Glu Lys Asp Phe Val Ser Tyr Val Asp Lys 
35 40 45 

ACT TCA GAG GAA AGG ATA AAG GAG GTG ATA CTC AAG TTC TTT CCC GAT 
Thr Ser Glu Glu Arg He Lys Glu Val He Leu Lys Phe Phe Pro Asp 
50 55 60 

CAC GAG GTC GTA GGG GAA GAG ATG GGT GCG GAG GGA AGC GGA AGC GAA 
His Glu Val Val Gly Glu Glu Met Gly Ala Glu Gly Ser Gly Ser Glu 
65 70 75 80 

TAC AGG TGG TTC ATA GAC CCC CTT GAC GGC ACA AAG AAC TAC ATA AAC 
Tyr Arg Trp Phe lie Asp Pro Leu Asp Gly Thr Lys Asn Tyr lie Asn 

85 90 95 

GGT TTT CCC ATC TTT GCC GTA TCA GTG GGA CTT GTT AAG GGA GAA GAG 
Gly Phe Pro He Phe Ala Val Ser Val Gly Leu Val Lys Gly Glu Glu 

100 105 HO 

CCA ATT GTG GGT GCG GTT TAC CTT CCT TAC TTT GAC AAG CTT TAC TGG 
Pro lie Val Gly Ala Val Tyr Leu Pro Tyr Phe Asp Lys Leu Tyr Trp 
115 120 125 



GGT GCT AAA GGT CTC GGG GCT TAC GTA AAC GGA AAG AGG ATA AAG GTA 



Gly Ala Lys Gly Leu Gly 
130 

AAG GAC AAT GAG AGT TTA 
Lys Asp Asn Glu Ser Leu 
145 150 

TCT AGG AGC AGG AGG GAC 
Ser Arg Ser Arg Arg Asp 

165 

GTC TTT TAC GAA GTT GGC 
Val Phe Tyr Glu Val Gly 

180 

GAC CTC TGC ATG GTG GCG 
Asp Leu Cys Met Val Ala 
195 

GAA ATG AAG CCG TGG GAC 
Glu Met Lys Pro Trp Asp 
210 

GCC GGG GGC GTT TAC ACA 
Ala Gly Gly Val Tyr Thr 
225 230 

ATA ATT GCG GGC AAC AAA 
lie He Ala Gly Asn Lys 

245 

AAA AAG TAT ATG GAA GTG 
Lys Lys Tyr Met Glu Val 

260 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Met 


Arg 


Gly 


Ser 


Gly 


Val 


Arg 


He 


Leu 


Leu 


Thr 


Asn 


Asp 


Asp 


Gly 


He 


1 






5 










10 










15 


Ala 


Phe 


Ala 


Glu 


Gly 


Leu 


Gly 


Ala 


Leu 


Arg 


Lys 


Met 


Leu 


Glu 


Pro 


Val 








20 










25 










30 






Thr 


Leu 


Tyr 


Val 


Val 


Ala 


Pro 


Asp 


Arg 


Glu 


Arg 


Ser 


Ala 


Ala 


Ser 


His 






35 










40 










45 








Ala 


He 


Thr 


Val 


His 


Arg 


Pro 


Leu 


Arg 


Val 


Arg 


Glu 


Ala 


Gly 


Phe 


Arg 




50 










55 










60 










Ser 


Pro 


Arg 


Leu 


Lys 


Gly 


Trp 


Val 


Val 


Asp 


Gly 


Thr 


Pro 


Ala 


Asp 


Cys 


65 








70 










75 










80 


Val 


Lys 


Leu 


Gly 


Leu 


Glu 


Val 


Leu 


Leu 


Pro 


Glu 


Arg 


Pro 


Asp 


Phe 


Leu 






85 










90 










95 




Val 


Ser 


Gly 


He 


Asn 


Tyr 


Gly 


Pro 


Asn 


Leu 


Gly 


Thr 


Asp 


Val 


Leu 


Tyr 






100 










105 










110 






Ser 


Gly 


Thr 


Val 


Ser 


Ala 


Ala 


He 


Glu 


Gly 


Val 


He 


Asn 


Gly 


lie 


Pro 




115 










120 










125 








Ser 


Val 


Ala 


Val 


Ser 


Leu 


Ala 


Thr 


Arg 


Arg 


Glu 


Pro 


Asp 


Tyr 


Thr 


Trp 



Ala Tyr Val Asn Gly Lys Arg He Lys Val 
135 140 

AAG CAC GCC GGA GTG GTT TAC GGA TTT CCC 
Lys His Ala Gly Val Val Tyr Gly Phe Pro 

155 160 



480 



ATA TCT ATC TAC TTG AAC ATA TTC AAG GAT 
He Ser He Tyr Leu Asn He Phe Lys Asp 

170 175 

TCT ATG AGG AGA CCC GGG GCT GCT GCG GTT 
Ser Met Arg Arg Pro Gly Ala Ala Ala Val 
185 190 

GAA GGG ATA TTT GAC GGG ATG ATG GAG TTT 
Glu Gly He Phe Asp Gly Met Met Glu Phe 
200 205 

ATA ACC GCA GGG CTT GTA ATA CTG AAG GAA 
He Thr Ala Gly Leu Val He Leu Lys Glu 
215 220 

CTT GTG GGA GAA CCC TTC GGA GTT TCG GAC 
Leu Val Gly Glu Pro Phe Gly Val Ser Asp 

235 240 



528 



576 



624 



672 



720 



GCC CTC CAC GAC TTT ATA CTT CAG GTA GCC 
Ala Leu His Asp Phe He Leu Gin Val Ala 

250 255 



768 



GCG GTG TGA 
Ala Val 



795 





130 






Ala 


Ala 


Arg 


Phe 


145 








Leu 


Pro 


Pro 


Gly 


Arg Gly 


Val 


Lys 








1 o u 


Val 


Asp 


Cys 


Arg 






195 




Gly 


Glu 


Pro 


Leu 




210 






Ala 


val 


Arg 


Glu 


225 








Thr 


Asn 


Tyr 


Gly 


lie 


Phe 


Ser 


Ser 








260 



135 

Val Leu Val Leu 
150 

Thr Leu Leu Asn 
165 

Val Thr Lys Leu 

Thr Asp Pro Arg 

200 

Glu Leu Asp Gly 
215 

Gly Tyr lie Ser 
230 

Phe Leu Glu Glu 
245 









14 0 


Leu 


Glu 


Glu 


Leu 






155 




Val 


Asn 


Val 


Pro 




170 






Gly 


Ser 


Val 


Arg 


185 








Gly 


Lys 


Ala 


Tyr 


Asn 


Asp 


Ser 


Glu 








220 


Val 


Thr 


Pro 


Val 






235 




Leu 


Lys 


Lys 


Trp 



250 



Arq 


Lys 


His 


Gin 








160 


Asp 


Gly 


Val 


Pro 






175 




Tyr 


val 


Asn 


val 




190 






Tyr 


Trp 


Met 


Ala 


205 








Thr 


Asp 


Val 


Trp 


Gin 


lie 


Asp 


Leu 








240 


Arg 


Phe 


Lys 


Asp 






255 





(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met 


Leu 


Asp 


He 


Leu 


Leu 


Val 


Asn 


Asp 


Asp 


Gly 


He 


Tyr 


Ser 


Asn 


Gly 


1 






5 










10 










15 




Leu 


He 


Ala 


Leu 


Lys 


Asp 


Ala 


Leu 


Leu 


Glu 


Lys 


Phe 


Asn 


Ala 


Arg 


lie 








20 






25 










30 






Thr 


He 


Val 


Ala 


Pro 


Thr 


Asn 


Gin 


Gin 


Ser 


Gly 


He 


Gly 


Arg 


Ala 


lie 






35 










40 










45 








Ser 


Leu 


Phe 


Glu 


Pro 


Leu 


Arg 


He 


Thr 


Lys 


Thr 


Lys 


Leu 


Ala 


Asp 


Gly 




50 










55 










60 










Ser 


Trp 


Gly 


Tyr 


Ala 


Val 


Ser 


Gly Thr 


Pro 


Thr 


Asp 


Cys 


Val 


He 


Leu 


65 






70 










75 










80 


Gly 


He 


Tyr 


Glu 


He 


Leu 


Lys 


Lys 


Val 


Pro 


Asp 


Val 


Val 


He 


Ser 


Gly 






85 










90 










95 




lie 


Asn 


He 


Gly 


Glu 


Asn 


Leu 


Gly Thr 


Glu 


He 


Thr 


Thr 


Ser 


Gly 


Thr 








100 










105 










110 






Leu 


Gly 


Ala 


Ala 


Phe 


Glu 


Gly 


Ala 


His 


His 


Gly 


Ala 


Lys 


Ala 


Leu 


Ala 




115 










120 










125 








Ser 


Ser 


Leu 


Gin 


Val 


Thr 


Ser 


Asp 


His 


Leu 


Lys 


Phe 


Lys 


Glu Gly 


Glu 




130 










135 










140 










Thr 


Pro 


He 


Asp 


Phe 


Thr 


Val 


Pro 


Ala 


Arg 


He 


Thr 


Ala 


Asn 


Val 


Val 


145 








150 










155 










160 


Glu 


Lys 


Met 


Leu 


Asp 


Tyr 


Asp 


Phe 


Pro 


Cys 


Asp 


Val 


Val 


Asn 


Leu 


Asn 








165 










170 










175 




He 


Pro 


Glu 


Gly 


Ala 


Thr 


Glu 


Lys 


Thr 


Pro 


He 


Glu 


lie 


Thr 


Arg 


Leu 








180 










185 










190 






Ala 


Arg 


Lys 


Met 


Tyr 


Thr 


Thr 


His 


Val 


Glu 


Glu 


Arg 


lie 


Asp 


Pro 


Arg 




195 










200 










205 








Gly 


Arg 


Ser 


Tyr 


Tyr 


Trp 


He 


Asp 


Gly 


Tyr 


Pro 


He 


Leu 


Glu 


Glu 


Glu 




210 










215 










220 










Glu 


Asp 


Thr 


Asp 


Val 


Tyr 


Val 


Val 


Arg 


Arg 


Lys 


Gly His 


He 


Ser 


Leu 


225 






230 










235 










240 


Thr 


Pro 


Leu 


Thr 


Leu 


Asp 


Thr 


Thr 


He 


Lys 


Asn 


Leu 


Glu 


Glu 


Phe 


Lys 










245 








250 










255 




Lys 


Lys 


Tyr 


Glu 


Arg 


He 


Leu 


Asn 


Glu 




















260 










265 

















(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 254 amino acids 
(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 



Met 


Met 


Met 


Glu 


Phe 


Thr 


Arq 

-3 


Glu 


Gly 


He 


Lys 


Ala 


Ala 


Val 


Glu 


Ala 


"1 
_L 








5 








10 










15 




Leu 


Gin 


Gly 


Leu 


Gly 


Glu 


He 


Tyr 


Val 


Val 


Ala 


Pro 


Met 


Phe 


Gin 


Arg 






20 








25 










30 






Ser 


Ala 


Ser 


Gly 


Arq 


Ala 


Met 


Thr 


lie 


His 


Arg 


Pro 


Leu 


Arg 


Ala 


Lys 






35 






40 










45 








Aro 


He 


Ser 


Met 


Asn 


Gly 


Ala 


Lys 


Ala 


Ala 


Tyr 


Ala 


Leu 


Asp 


Gly 


Met 


50 










55 










60 










Pro 


Val 


Asp 


Cys 


Val 


lie 


Phe 


Ala 


Met 


Ala 


Arg 


Phe 


Gly 


Asp 


Phe 


Asp 


65 






70 










75 










80 


Leu 


Ala 


He 


Ser 


Gly 


Val 


Asn 


Leu Gly 


Glu 


Asn 


Met 


Ser 


ihr 


Glu 


He 










85 










90 










95 




Thr 


Val 


Ser 


Gly 


Thr 


Ala 


Ser 


Ala 


Ala 


He 


Glu 


Ala 


Ala 


rnr 


Gin 


Glu 








100 










105 










110 






He 

J- -J_ 


Pro 


Ser 


He 


Pro 


He 


Ser 


Leu 


Glu 


Val 


Asn 


Arg 


Glu 


Lys 


His 


Lys 






115 










120 










125 








Phe 


Gly Glu 


Gly 


Glu 


Glu 


lie 


Asp 


Phe 


Ser 


Ala 


Ala 


Lys 


Tyr 


Phe 


Leu 




130 










135 










14 0 










Arg 


Lys 


He 


Ala 


Thr 


Ala 


Val 


Leu 


Lys 


Arg 


Gly 


Leu 


Pro 


Lys 


Gly Val 


145 








150 










1 t~ V 

155 










160 


Asp 


Met 


Leu 


Asn 


Val 


Asn 


Val 


Pro 


Tyr 


Asp 


Ala 


Asn 


Glu 


Arg 


Thr 


Glu 








165 










170 










175 




He 


Ala 


Phe 


Thr 


Arg 


Leu 


Ala 


Arg 


Arg 


Met 


Tyr 


Arg 


Pro 


Ser 


He 


Glu 








180 








185 










190 








Arg 


lie 


Asp 


Pro 


Lys 


Gly 


Asn 


Pro 


Tyr 


Tyr 


Trp 


He 


Val 


Gly 


Thr 




195 










200 










205 








m n 


Cys 


Pro 


Lys 


Glu 


Ala 


Leu 


Glu 


Pro 


Gly 


Thr 


Asp 


Met 


Tyr 


Val 


Val 




210 








215 










220 










Lys 


Val 


Glu 


Arg 


Lys 


Val 


Ser 


Val 


Thr 


Pro 


He 


Asn 


He 


Asp 


Met 


Thr 


225 








230 










235 










240 


Ala 


Arg 


Val 


Asn 


Leu 


Asp 


Glu 


He 


Lys 


Arg 


Leu 


Leu 


Glu 


Leu 












245 










250 


















(2) INFORMATION FOR SEQ ID 


NO:! 


31 : 














(i) SEQUENCE CHARACTERISTICS: 




















(A) 


LENGTH : 


271 


amino acids 




















(B) 


TYPE : amino 


acid 






















(D) 


TOPOLOGY: linear 




















(ii) 1 


MOLECULE 


TYPE: protein 


















(v) FRAGMENT ' 


rYPE 


: internal 


















(xi) . 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


31 : 










Met 


Arg 


Thr 


Leu 


Thr 


lie 


Asn 


Thr 


Asp 


Ala 


Glu 


Gly 


Phe 


Val 


Leu 


Arg 


1 






5 










10 










15 




He 


Leu 


Leu 


Thr 


Asn 


Asp 


Asp 


Gly 


He 


Tyr 


Ser 


Asn 


Gly 


Leu 


Arg 


Ala 








20 










25 










30 






Ala 


Val 


Lys 


Ala 


Leu 


Ser 


Glu 


Leu 


Gly 


Glu 


Val 


Tyr 


Val 


Val 


Ala 


Pro 







35 




Leu 


Phe 


Gin 


Arg 




50 






lie 


Arg 


TV 1 — 

Ala 


Lys 


65 








lie 


Asp 


pi , r 

vjiy 


1 ilL 


Gly 


Ser 


Phe 


Gly 








100 


her 


iui 


Hi M 

biU 


lie 






115 




Ser 


Thr 


His 


Gly 




I --i 

130 






Lys 


Lys 


Thr 


Leu 


145 








nlS 


Pne 


Leu 


Lys 


Glu 


Gly 


Val 


Asp 








180 


GlU 


Thr 


GlU 


TI- 
116 






195 




Thr 


Val 


Glu 


Glu 




210 






Val 


Gly 


Lys 


Leu 


225 








Leu 


Lys 


Val 


Glu 


Thr 


Ala 


Arg 


Val 








260 



Ser 


Ala 


Ser 


Gly 






c c 

b b 




Arg 


val 


Asp 


val 




n n 
1 u 






y~\ 

Pro 


l nr 


Asp 


L.ys 


ft c: 








Leu 


nia 


Vdl 




Thr 


Val 


Ser 


Gly 








ion 


lie 


pro 


t»er 


Tift 

lie 






lie, 

i j j 




Gly 


GlU 


Gly 


(jlU 




lbu 






Arg 


lie 


Ala 


pi ,, 

biy 


1 0 D 








wee 


Leu 


Asn 


vai 


Ala 


lie 


Thr 


Arg 








200 


Arg 


He 


Asp 


Pro 






215 




Val 


Gin 


Asp 


Phe 




230 






Arg 


Lys 


Val 


Ser 


245 








Asp 


Phe 


Glu 


Glu 



Arg 


Ala 


Met 


Thr 








c n 
o V 


n -v*» 

Pro 


pi,, 

oiy 


a.i a 


bys 












T 1 - 

lie 


rne 


Mid 




q n 






nl if 
Lj! y 


Tip 
lie 


ZV c n 




1 U O 








i fix 


HI d 


Cor 


Til a 
nio 


Ala 


lie 


Ser 


Leu 










(jiy 


vai 


"A 

ASp 


pne 






155 






jjeu 


Leu 


Glu 




1 / V 








vai 


Pro 


Ser 


1 ft 

-L Q _J 








Leu 


Ala 


Arg 


Lys 


Lys 


Gly 


Asn 


Pro 








220 


Glu 


Pro 


Gly Thr 






235 




Val 


Thr 


Pro 


He 




250 






Leu 


Val 


Arg 


Val 



265 



45 

Leu His Arg Pro 

He Ala Tyr Gly 

80 

lie Ala Arg Phe 
95 

Gly Glu Asn Leu 
110 

Ala He Glu Ala 
125 

Glu Val Glu Trp 

Ser Val Ser Thr 

160 

Arg Gly Leu Pro 
175 

Asp Ala Thr Glu 
190 

Arg Tyr Ser Pro 
205 

Tyr Tyr Trp He 

Asp Ala Tyr Ala 

240 

Asn He Asp Met 
255 

Leu Trp Val 
270 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 497 amino acids 
{B} TYPE; amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Met 


Lys 


Gly 


Lys 


Ser 


Leu 


Val 


Ser 


Gly 


Leu 


Leu 


Leu 


Gly 


Leu 


Leu 


He 


1 








5 










10 










15 




Leu 


Ser 


Leu 


He 


Ser 


Phe 


Gin 


Pro 


Ser 


Phe 


Ala 


Tyr 


Ser 


Pro 


His 


Gly 








20 










25 










30 






Gly Val 


Lys 


Asn 


He 


He 


He 


Leu 


Val 


Gly 


Asp 


Gly 


Met 


Gly 


Leu 


Gly 






35 










40 










45 








His 


Val 


Glu 


He 


Thr 


Lys 


Leu 


Val 


Tyr 


Gly 


His 


Leu 


Asn 


Met 


Glu 


Asn 




50 










55 










60 










Phe 


Pro 


Val 


Thr 


Gly 


Phe 


Glu 


Leu 


Thr 


Asp 


Ser 


Leu 


Ser 


Gly 


Glu 


Val 


65 










70 










75 










80 


Thr 


Asp 


Ser 


Ala 


Ala 


Ala 


Gly 


Thr 


Ala 


He 


Ser 


Thr 


Gly Ala 


Lys 


Thr 








85 










90 










95 




Tyr 


Asn 


Gly 


Met 


He 


Ser 


Val 


Thr 


Asn 


He 


Thr 


Gly 


Lys 


He 


Val 


Asn 








100 










105 










110 






Leu 


Thr 


Thr 


Leu 


Leu 


Glu 


Val 


Ala 


Gin 


Glu 


Leu 


Gly 


Lys 


Ser 


Thr 


Gly 






115 










120 










125 








Leu 


Val 


Thr 


Thr 


Thr 


Arg 


He 


Thr 


His 


Ala 


Thr 


Pro 


Ala 


Val 


Phe 


Ala 




130 










135 










140 










Ser 


His 


Val 


Pro 


Asp 


Arg 


Asp 


Met 


Glu 


Gly 


Glu 


He 


Pro 


Lys 


Gin 


Leu 


145 










150 










155 










160 


He 


Met 


His 


Lys 


Val 


Asn 


Val 


Leu 


Leu 


Gly 


Gly 


Gly 


Arg 


Glu 


Lys 


Phe 



165 170 175 





Glu 


Lvs 


Asn 


Leu 


Glu 


Leu 


Ala 








1 R0 










Phe 


Thr 


Lys 


Glu 


Glu 


Leu 


Glu 


Lys 






195 

~J_ 










200 


Leu 

JuJ V|A 


Phe 


Ala 


Glu 


Ser 


His 


He 


Pro 




_L W 










215 




Asp 


Val 


Gly 


Leu 


Leu 


Glu 


Met 


Ala 


225 










230 






Lys 


Asn 


Pro 


Ser 


Gly 


Phe 


Phe 


Leu 










245 








His 


Ala 


Ala 


His 


Gly 


Asn 


Asp 


Val 








260 










Glu 


Phe 


Asp 


Asp 


Val 


Val 


Arg 


Tvr 






275 










280 


Gly Asp 


Thr 


Leu 


Val 


He 


Val 


Leu 




290 










2 95 




Ala 


He 


Gly Leu 


Thr 


Tvr 

i y J. 


Glv 


Asn 


305 










310 






Lys 


He 


Lys 


Ala 


Ser 


Thr 


Leu 


Arg 










325 








Ser 


Ser 


Val 


Lys 


Glu 


Ser 


Ser 


Lys 








340 










Gly Arg 


Ser 


Gin 


Tvr 


lie 


Glu 


Asn 






355 










360 


Ala 


Leu 


Ser 


Asn 


Ala 


Val 


Ala 


Asp 




370 










375 




Gly 


Phe 


Thr 


Ser 


Tvr 


Glu 


His 


Thr 


385 










3 90 

J *J V 






Tyr Gly 


Pro 


Gly 


Ala 


Glu 


Asn 


Phe 










405 

T w J 








Thr 


Ala 


Arg 


Leu 


Val 


Ala 


Lys 


Leu 








420 










Pro 


Val 


Thr 


He 


C fs y 




Val 

V d X 


Ser 






435 










440 


Asp 


Tyr 


Arg 


Val 


Asp 


Glu 


Lys 


Asp 




450 










455 




Leu 


Gly 


Glu 


Lys 


Val 


Asp 


Asn 


Glu 


465 










470 






Asn 


Asn 


Gly Met 


Val 


Asp 


Leu 


Asn 










485 








Ala 

















Lvs 


Lvs 


Gin 


Gly Tyr 


Lys 


Val 


Val 


185 










190 






Val 

V VA 


Glu 

w -4- \A 


Gly Asp 


Tyr 


Val 


Leu 


Gly 










205 








Tvr 


Val 


Leu 


ASD 

■1*. *J f-* 


Arg 


Lys 


Pro 


Asp 








220 










Lys 


Lys 


Ala 


He 


Ser 


He 


Leu 


Glu 






235 










240 


Met 


Val 


Glu 


Glv 


Gly Arg 


He 


Asp 




250 










255 




Ala 


Ser 


Val 


Val 


Ala 


Glu 


Thr 


Lys 


265 

u u 










270 






Val 


Leu 


Glu 


Tvr 


Pro 


Lys 


Lys 


Arg 










285 








Ala 


Asp 


His 


Glu 


Thr Gly 


Gly 


Leu 








300 










Ala 


lie 




Glu 


Asp 


Ala 


He 


Arg 






315 










320 


Met 


Pro 


TiVS 


Glu 


Val 


Lys 


Ala 


Gly 




3 30 

O W 










335 




Val 


Cys 




He 


Cys 


Pro 


Asn 


Arg 


345 










350 






Ala 


Leu 


Hi s 


Ser 


Thr 


Asn 


Lys 


Tyr 










365 








Val 


He 




Arg 


Arg 


He 


Gly 


Val 








380 










Glv 


Val 


Prn 

IT .L VJ 


Val 


Pro 


Leu 


Leu 


Ala 




395 










400 


Aro 


Glv 




Leu 


His 


His 


Val 


Asp 




410 

7 -L v 










415 




Met 


Leu 


Phe 


Gly Arg Arg 


Asn 


lie 












430 






Ser 


Val 


Lys 


Gly Asp 


lie 


Thr 


Gly 










445 








Ala 


Tyr 


Val 


Thr 


Leu 


Met 


Met 


Phe 






460 










He 


Glu 


Lys 


Arg 


Val 


Asp 


He 


Asp 






475 










480 


Asp 


Val 


Met 


Leu 


He 


Leu 


Gin 


Glu 


490 










495 





(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 33: 

Met Pro Arg Asn lie Ala Ala Val Cys Ala Leu Ala Ala Leu Leu Gly 

15 10 15 

Ser Ala Trp Ala Ala Lys Val Ala Val Tyr Pro Tyr Asp Gly Ala Ala 

20 25 30 

Leu Leu Ala Gly Gin Arg Phe Asp Leu Arg lie Glu Ala Ser Glu Leu 

35 40 45 

Lys Gly Asn Leu Lys Ala Tyr Arg lie Thr Leu Asp Gly Gin Pro Leu 

50 55 60 

Ala Gly Leu Glu Gin Thr Ala Gin Gly Ala Gly Gin Ala Glu Trp Thr 



Leu 


Arg 


Gly 


Ala 


Leu 


Thr 


Asp 


Asp 








100 


Arg 


Gin 


Asn 


Leu 






115 




He 


Gly 


Asp 


Gly 




130 






Ala 


Lys 


Gly 


Phe 


145 








He 


Glu 


Ser 


Gly 


Asp 


Ser 


Phe 


lie 








180 


Gly Gin 


Lys 


Val 






195 




Lys 


Asp 


Thr 


Leu 




210 






Lys 


Arg 


Val 


Arg 


225 








Thr 


Asp 


Ala 


Thr 


Asp 


Tyr 


Gin 


Ala 








260 


Val 


Pro 


Leu 


Asp 






275 




Gin 


Ser 


Thr 


Pro 




290 






Glu 


Ser 


Gin 


Lys 


305 








Leu 


Ala 


Ala 


Lys 


Asn 


Phe 


Pro 


Ser 








340 


Leu Gly 


Ser 


Phe 






355 




Ala 


Val 


Glu 


Ala 




370 






Glu Gly 


Gly 


Met 


385 








Ala 


Leu 


Trp 


Asp 


Gly Tyr 


Ala 


Ala 








420 


His 


Ala 


His 


Ser 






435 




Gly Arg 


Glu 


Gly 




450 






Gly Asp 


Lys 


Lys 


465 








Gly 


He 


Ala 


Val 


Arg Gly 


Arg 


Glu 








500 


Gly Tyr 


Val 


Ala 






515 




Tyr 


Arg 


Gin 


Leu 




530 






Pro 


Met 


Pro 


Leu 


545 








Leu 


He 


Asp 


Gin 


Phe 


Asn 


Pro 


His 








580 





70 






Phe 


Leu 


Arg 


Pro 


85 








Ala 


Gly 


Glu 


Ser 


Arg 


Leu 


T") -v** 


Arg 








120 


Met 


Gly 


Trp 


Asn 






135 




Asn 


Pro 


Glu 


Asn 




150 






Tyr 


Gly 


Gly 


Met 


165 








Ala 


Asp 


Ser 


Ala 




vai 


Asn 


Ala 

Ala 








200 


Ala 


Tyr 


Pro 


Arg 






215 




Gly 


Ala 


Ser 


lie 




230 






Pro 


Ala 


Ser 


Leu 


245 








lie 


Ala 


Asp 


Met 


IT-, "1 

va i 


riec 




irllG 








280 


Gly 


Ser 


Arg 


Arg 






295 




Leu 


Gly 


Tyr 


Thr 




310 






Pro 


Thr 


Asp 


Lys 


325 








Tyr 


Leu 


Asp 


Arg 


Thr 


Asp 


Met 


Pro 








360 


Leu 


Ser 


Arg 


Asn 






375 




Val 


Asp 


Lys 


Tyr 




390 






Val 


Leu 


Glu 


Leu 


405 








Ser 


His 


Pro 


Asp 


lie 


O d. 

be! 


va J. 


rile 








440 


Val 


Gly 


Val 


Tyr 






455 




Asp 


Ala 


Asn 


Gly 




470 






Gly 


Phe 


Gly 


Ala 


485 








Val 


Tyr 


Lys 


Asp 


Asn 


Pro 


Glu 


Val 








520 


Pro 


Val 


Asp 


Ser 






535 




Phe 


Ala 


Phe 


Gly 




550 






Thr 


Glu 


He 


Phe 


565 








Leu 


Glu 


Lys 


Pro 







75 




Gly 


Ser 


His 


Thr 




90 






Arg 


Lys 


Ser 


Val 


105 








Ala 


Ala 


Lys 


Asn 


inr 


T ^ T 7 

ijeu. 


i\b 11 


a i a 

n.i d 








140 


Gly 


Met 


Pro 


Asn 






155 




Ala 


Thr 


Val 


Thr 




170 






Asn 


Ser 


Ala 


Ser 


185 








Leu 


Asn 


Val 


Tyr 


H ^ 




1 II X- 










220 


Gly 


Val 


Val 


Thr 






235 




Asn 


Ala 


His 


Thr 




250 






Tyr 


Phe 


Gly 


Arg 


265 








Gly 


Gly 


Ser 


Arg 






OCX. 


TV) y 








300 


Phe 


Val 


Ser 


Thr 






315 




Leu 


Phe 


Gly 


Leu 




330 






Ala 


Val 


Trp 


Lys 


345 








Tyr 


Leu 


Trp 


Glu 




Liy b 


ijiy 


c lie 








380 


Glu 


His 


Pro 


Leu 






395 




Asp 


Arg 


Ala 


Val 




410 






Thr 


Leu 


Val 


He 


425 








Gly 


Gly 


Tyr 


Asp 


tjlU 


Aia 


A J. a 


T T/C 

Xjy & 








460 


Phe 


Pro 


Leu 


Pro 






475 




Thr 


Pro 


Asp 


Tyr 




490 






Pro 


Thr 


He 


Ser 


505 








Cys 


Lys 


Glu 


Pro 


Ala 


Gin 


Gly 


Val 








540 


Val 


Gly 


Ser 


Gin 






555 




Phe 


Arg 


Met 


Ala 




570 













80 


Leu 


Glu 


Val 


Ser 






95 




Arg 


Trp 


Glu 


Ala 




110 






Val 


lie 


Leu 


Phe 


125 








Ala 


Arg 


He 


lie 


\j _l y 


■rvoll 












160 


Thr 


Gly 


Ser 


Phe 






175 




Ser 


He 


Met 


Thr 




190 






Pro 


Ser 


Asn 


Leu 


205 








Ala 


Glu 


Met 


Leu 


Thr 


Thr 


Phe Gly 








240 


Arg 


Arg 


Arg Gly 






255 




Gly 


Gly 


Phe 


Gly 




270 






Asp 


Phe 


He 


Pro 


285 








Asp 


Trp 


Tl A 

J. J. tr 






OCI 


Glu 


Leu 








320 


Phe 


Asn 


He 


Asp 






335 




Arg 


Pro 


Glu 


Met 




350 






Met 


Thr 


Gin 


Lys 


365 








Phe 


Leu 


Met 


Val 


Afap 




Pro 


Arg 








400 


Ala 


Trp 


Ala 


Lys 






415 




Val 


Thr 


Ala 


Asp 




430 






Tyr 


Ser 


Lys 


Gin 


445 








Phe 


Pro 


Thr 


Tyr 




X 111. 


Thr 


Arg 








480 


Cys 


Glu 


Thr 


Tyr 






495 




Asp 


Gly 


Lys 


Gly 




510 






Gly 


Leu 


Pro 


Thr 


525 








His 


Thr 


Ala 


Asp 


Phe 


Phe 


Asn 


Gly 








560 


Gin 


Ala 


Leu 


Gly 






575 





(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



Met 


Tyr 


Lvs 


Trr> 


lie 


He 


Glu 


Gly Lys 


Leu 


Ala 


Gin 


Ala 


Pro 


Phe 


Pro 


1 




5 










10 










15 




Ser 


Leu 


Glv 


Glu 


Leu 


Ala 


Asp 


T iPii 

<LJ \A 


Lvs 


Arg 


Leu 


Phe 


Asp 


Ala 


He 


He 






20 










25 










30 






Val 


Leu 


Thr 


Met 


Pro 


His 


Glu 


Gin 


Pro 


Leu 


Asn 


Glu 


Lys 


Tyr 


lie 


Glu 






3 5 










4 0 










45 








lie 


Leu 


Glu 




His 


Glv 


Phe 


Gin 


Val 


Leu 


His 


Val 


Pro 


Thr 


Leu 


Asp 




50 










55 










60 










Phe 


His 


Pro 


Leu 


Glu 


Leu 


Phe 


Aso 


Leu 

4—1 *w u- 


Leu 


Lys 


Thr 


Ser 


He 


Phe 


lie 


65 










70 










75 










80 


Asp 


Glu 


Asn 


Leu 


Glu 


Arq 


Ser 


His 


Ara 


Val 


Leu 


Val 


His 


Cys 


Met 


Gly 








85 










90 










95 




Gly 


lie 


Glv 


Aror 


Ser 


Gly 


Leu 


Val 


Thr 


Ala 


Ala 


Tyr 


Leu 


lie 


Phe 


Lys 






100 










105 










110 






Gly 


Tyr 


ASD 


He 


Tyr 


Asp 


Ala 


Val 


Lvs 


His 


Val 


Arg 


Thr 


Val 


Val 


Pro 




115 










120 










125 








Gly Ala 


lie 


Glu 


Asn 


Arg 


Gly 


Gin 


Ala 


Leu 


Met 


Leu 


Glu 


Asn 


Tyr 


Tyr 




130 










135 










140 










Thr 


Leu 


Val 


Lys 


Ser 


Phe 


Asn 


Arg 


Glu 


Leu 


Leu 


Arg 


Asp 


Tyr 


Gly 


Lys 


145 








150 










155 










160 


Lys 


lie 


Phe 


Thr 


Leu 


Gly 


Asp 


Pro 


Lys 


Ala 


Val 


Leu 


His 


Ala 


Ser 


Lys 








165 










170 










175 




Thr 


Thr 


Gin 


Phe 


Thr 


He 


Glu 


Leu 


Leu 


Ser 


Asn 


Leu 


His 


Val 


Asn 


Glu 








180 










185 










190 






Ala 


Phe 


Ser 


He 


Ser 


Ala 


Met 


Ala 


Gin 


Ser 


Leu 


Leu 


His 


Phe 


His 


Asp 






195 










200 










205 








Val 


Lys 


Val 


Arg 


Ser 


Lys 


Leu 


Lys 


Glu 


Val 


Phe 


Glu 


Asn 


Met 


Glu 


Phe 




210 










215 










220 










Ser 


Ser 


Ala 


Ser 


Glu 


Glu 


Val 


Leu 


Ser 


Phe 


He 


His 


Leu 


Leu 


Asp 


Phe 


225 










230 










235 










240 


Tyr 


Gin 


Asp 


Gly 


Arg 


Val 


Val 


Leu 


Thr 


He 


Tyr Asp 


Tyr 


Leu 


Pro 


Asp 








245 










250 










255 




Arg 


Val 


Asp 


Leu 


He 


Leu 


Leu 


Cys 


Lys 


Trp 


Gly 


Cys 


Asp 


Lys 


He 


Val 






260 










265 










270 






Glu 


Val 


Ser 


Ser 


Ser 


Ala 


Lys 


Lys 


Thr 


Val 


Glu 


Lys 


Leu 


Val 


Gly Arg 






275 










280 










285 








Lys 


Val 


Ser 


Leu 


Ser 


Trp 


Ala 


Asn 


Tyr 


Leu 


Asp 


Tyr 


Val 








290 










295 










300 











(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 35: 



Met 


Arg 


lie 


Leu 


Leu 

c 


Tnr 


Asn 


Asp 


l 

Arg 


Ala 


Ala 


Val 


D 

Lys 


Gly 


Leu 


Ser 








20 










Ala 


Pro 


Leu 


Phe 


Gin 


Arg 


Ser 


Ala 






35 










4 0 


Arg 


Pro 


He 


Arg 


Ala 


Lys 


Arg 


val 




bl> 










D _) 




Tyr 


Gly 


He 


Asp 


Gly 


Tnr 


Pro 


I nr 


65 










/ 0 






Arg 


Phe 


Gly Asp 


Pne 


Asp 


Leu 


Ala 










o b 








Asn 


Leu 


Ser 


Thr 


Glu 


He 


Thr 


vai 








100 










Glu 


Ala 


Ser 


Thr 


His 


Gly 


lie 


T"N UK j*. 

Pro 






115 










12 0 


Glu 


Trp 


Lys 


Lys 


Thr 


Leu 


Gly 


Glu 




*1 1 A 










1j J 




Ser 


Ala 


His 


Phe 


Leu 


Arg 


Arg 


T 1 * 

lie 


1 A C 

14 5 










lb U 






T — . _ 

Leu 


Pro 


Glu 


Gly 


val 


Asp 


Met 


Leu 










Xb b 








Ser 


Glu 


Gly Thr 


Glu 


He 


Ala 


He 








180 










Ser 


Pro 


Thr 


He 


Glu 


Glu 


Arg 


He 






195 










200 


Trp 


lie 


Val 


Gly 


Arg 


Leu 


Val 


Gin 




210 










215 




Tyr 


Ala 


Leu 


Lys 


Val 


Glu 


Arg 


Lys 


225 










230 






Asp 


Met 


Thr 


Ala 


Arg 


Val 


Asp 


Phe 










245 









Leu 



Asp 


Gly 


He 


Tyr 


J~* -nW 

Ser 


Asn Gly 


Leu 




1U 










15 




Glu 


Leu 


Gly Glu 


val 


Tyr 


Val 


Val 


25 










30 






Ser 


Gly 


Arg 


Mia 


Met 


Thr 


Leu 


His 










4 b 








Asp 


val 


fro 


HI IT 

Giy 


Aia 


Lys 


He 


Ala 








60 










Asp 


cys 


Vdi 


ne 


f ne 


Ala 


He 


Ala 






7 b 










80 


Val 


ser 


uiy 


11c 


Asn 


Leu Gly Glu 














95 




ser 


biy 


I ill 


a") a 


O rt v- 

ber 


Ala 


Ala 


lie 


x u b 










110 






Ser 


val 


Aia 


lie 


ber 


Leu 


Glu 


Val 










IOC 








Gly 


GlU 


c?iy 


lie 


Asp 


Phe 


Ser 


Val 








14 U 










Ala 


Thr 


Ala 


vai 


Leu 


Lys 


Lys 


Gly 






155 










160 


Asn 


val 


Asn 


Val 


Pro 


Ser 


Asp 


Ala 




1 / u 










175 




nil*. 

Thr 


Arg 


Leu 


Ala 


Arg 


Lys 


Arg 


Tyr 


185 










190 






Asp 


Pro 


Lys 


Gly 


Asn 


Pro 


Tyr 


Tyr 










205 








Glu 


Phe 


Glu 


Pro 


Gly Thr Asp Ala 








220 










Val 


Ser 


Val 


Thr 


Pro 


He 


Asn 


He 






235 










240 


Glu 


Asn 


Leu 


Gin 


Arg 


Leu 


Leu 


Ser 




250 










255 





(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Met 


Glu 


Asn 


Leu 


Lys 


Lys 


Tyr 


Leu 


Glu 


Val 


Ala 


Lys 


He 


Ala 


Ala 


Leu 


1 








5 










10 










15 




Ala 


Gly 


Gly 


Gin 
20 


Val 


Leu 


Lys 


Glu 


Asn 
25 


Phe 


Gly 


Lys 


Val 


Lys 
30 


Lys 


Glu 


Asn 


lie 


Glu 
35 


Glu 


Lys 


Gly 


Glu 


Lys 
40 


Asp 


Phe 


Val 


Ser 


Tyr 
45 


Val 


Asp 


Lys 


Thr 


Ser 
50 


Glu 


Glu 


Arg 


He 


Lys 
55 


Glu 


Val 


lie 


Leu 


Lys 
60 


Phe 


Phe 


Pro 


Asp 


His 


Glu 


Val 


Val 


Gly 


Glu 


Glu 


Met 


Gly 


Ala 


Glu 


Gly 


Ser 


Gly 


Ser 


Glu 


65 










70 










75 










80 


Tyr 


Arg 


Trp 


Phe 


He 
85 


Asp 


Pro 


Leu 


Asp 


Gly 
90 


Thr 


Lys 


Asn 


Tyr 


He 
95 


Asn 


Gly 


Phe 


Pro 


He 


Phe 


Ala 


Val 


Ser 


Val 


Gly 


Leu 


Val 


Lys 


Gly 


Glu 


Glu 






100 










105 










110 






Pro 


He 


Val 
115 


Gly 


Ala 


Val 


Tyr 


Leu 
120 


Pro 


Tyr 


Phe 


Asp 


Lys 
125 


Leu 


Tyr 


Trp 



1 



Gly Ala 


Lvs 


Gly 


Leu 


Gly 


Ala 


Tyr 


Val 


Asn 


Gly Lys 


Arg 


He 


Lys 


Val 




130 










135 










140 










Lys 




Asn 


Glu 


Ser 


Leu 


Lys 


His 


Ala 


Gly 


Val 


Val 


Tyr 


Gly 


Phe 


Pro 


145 








150 










155 

_t- mj —J 










160 


Ser 


Arg 


Ser 


Aro 


Arg 
165 

J- W 


Asp 


lie 


Ser 


He 


Tyr 
170 


T iPi I 

■M L*L 


Aqti 


He 


Phe 


Lys 
175 


Asp 

*- 


Val 


Phe 


Tvr 


Glu 
180 


Val 


Gly 


Ser 


Met 


Arq 
185 


Arg 


Pro 


Glv 


Ala 


Ala 
190 


Ala 


Val 


Asp 


Leu 


Cys 


Met 


Val 


Ala 


Glu 


Gly 


He 


Phe 


ft ^ n 


Gl v 


Met 


Met 


Glu 


Phe 




195 










200 










205 








Glu 


Met 
210 


Lys 


Pro 


Trp 


Asp 


lie 
215 


Thr 


Ala 


Gly 


Leu 


Val 
220 


He 


Leu 


Lys 


Glu 


Ala Gly 


Gly 


val 


Tyr 


Thr 


Leu 


Val 


Gly 


Glu 


Pro 


Phe 


Gly Val 


Ser 


Asp 


225 






230 










235 










240 


lie 


lie 


Ala 


Gly 


Asn 


Lys 


Ala 


Leu 


His 


Asp 


Phe 


He 


Leu 


Gin 


Val 


Ala 








245 










250 










255 




Lys 


Lys 


Tyr 


Met 
260 


Glu 


Val 


Ala 


Val 



















(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TGCCCGAGCG TGTTGCCAAG ATGCTTGAAA GAATGCTATC CAAGGCGGAA TCTATGCTCG 6 0 

GCGACGCCCA GAGGCTTATC GAGGAGGGTA AGGCCGTTGA GGCTAAGAAG CTGTTAGCGG 12 0 

CTGCTCATAG GCTAGTAGAT CGCCTAGAGG ATGCTCTCGA CCACGCCCTC AACCATATAG 180 

AG CATC AC AA GGAACATCAT GAGGAGCACC ACAAGGAGCA CGACTAACAA CACTCTTAGA 24 0 

ATCTCGAGAC GAGCTTGCTT CCCGTGTCTC TCGCGCCTAG CCAGTTTTTA ATAGCCTAAG 3 00 

CCGAGACCCA CATTCCAACA TTACTCCGTT TGTCACTATC ATGTTCTAAT TGTCACACGC 360 

CCCGTATAAA TTGGGGGACC TGGAGGAAGC GTTGCCGGTG ACCCCGCGTG GCCAAGAAGG 420 

CTGTCTGCCC AATATGCGGT GGCGATGTTG AACTACCCGA TAACGTAATG GATGGCGAGA 480 

TCGTGGAGCA CG AC TGTGGG GCAATGCTAG TCGTGAGGAT CCGGGATGGC AATGTTGTTC 54 0 

TAGAGCAGTT GGAGCGCGTT GAGGAGGACT GGGG AG AG TA GAGGCTATGC GCATAGCAAT 600 

CGTTTATGAC CATCCGCGTG TTGAGGAGAA GAGGTTAGCT GAGGAAGCGA GGAAGCTTGG 660 

TCACGAACCT GTCCTCTTTA ATATTGACTC GTTGCTCTTT CGCCTTGATA GCCTGGAGCG 72 0 

CATTCTAGGC GATGTTGATG TAGTACTTCA GAGGGCGGTG AGTTACTTCA AGGCTCTCGA 780 

GTCTACAAGG ATACTCGAGG CTGCCGGCTA CACTGTCATC AACAATAGTT TAGTGCAGCT 84 0 

TAACTGCGGC GACAAACTAT TGACAACGAT CTTGCTTGCT AAGCATGGTG TGCCAACACC 900 

GCGTGCATAC GCTGCTTTTT CGCGTGACAC TGCTGTGCGG GCTGCAGAGG AGCTTGGATA 960 

CCCCGTTGTT GTCAAGCCCG TCATTGGTAG TTGGGGTAGG CTTGTGGCTA GGGCTGATTC 102 0 

CAGGGAGAGT CTAGAGGCTG TGATAGAGCA TAGAGAGGTT CTCGGCCCGG CTTACTACAA 108 0 

GGTTCATTAT GTGCAAGAGT ATGTGCGCAA GCCTCTACGT GACATACGCG TATTCGTGAT 114 0 

TGGTGATGAG GTTCCCGTGG CG AT AT AC AG GGTTAACGAG CGTCATTGGA AGACTAACAC 12 00 

GGCACTAGGC GCCAAGGCCG AGCCTGCGCC AGTGACCCCC GAGTTACGTG AGTTAGCGCT 1260 

TCGCGCGGCC AAGGCTGTGG GTGGCGGTGT GCTTGGTATA GATGTGTTTG AAGACCCGGA 132 0 

GAGAGGCCTC CTCGTGAACG AGATTAACGC GAACTCGGAC TTCAAGAACA CTGAGAGGGT 13 80 

GACCGGGTTT AACATGGCTA GGGCTATCGT CGAGTATGCA GTGTCGGTCG CG AAGAGGTG 144 0 

AATGGAATGG ATAGGGTAGA GGTGCTTCTG GATGAGGCTA GGCGTGGCGC TATAGAGGGT 1500 

GACGCTCGCC GCGCATGTGA AGCGGCATTA AGGCTGGTTG ACGTTGTGCT CCGCGAGGGG 1560 

CCTAGGGTTG CACAGGAGTC TGGGCGTGGG ATTGAACCCG GTGATGTACT ACTAGCTGAG 1620 

GCTCTGAGCT TGAGAGCAGA GCAGGTGAAG GAGGAGCCCA AGGCGGACAA TTGTCTGGAG 168 0 

CTCGCAAAGG CTGCATTCCG CCTCTATAAG CGGCTCCAGG GGATGGAGTA AAGTTCGCAG 174 0 

TGTGTTGCCC GTTTTAGCCT CTGCCTTACT TTCTACTCGC GTGAGGCGAG TGTCCCTTGA 180 0 

CACGTTGCTG GCGCGAGCTG AGAAACGACC TCGAGATGAT ACCCGAGATC GTCGAGAAGC 1860 

AGATCGAGGA GACGATAGTG CCGGAGGGTC TTGGCGAGCA ACGACTTGTG TTCATTGGCA 1920 

GCGGTGATTC TTTCGCGGCC GCACTTGTAG CCGAGCATGC CGGCATAGGC GTCGCACGCG 1980 

ATCCTCTTGA TGTGCTAGTG GCTGGCGTTG ATGGGCCTGG CGACGCTATA CTCCTAAGCG 204 0 



TTGGTGGGCG CTCAAAACGA GTTGTTGACG CGGCTCGTTT CCTGTCTTCA CGTGGCTTTC 2100 

GTATCATAGC GGTCACGGGT AACGAGAGGA GTCCTCTCGC ACGCACAGCA CACGTTACCG 2160 

TGAAGCTCGT CTATTCTGAC CTCGCCTGTG GCATGGGCGC CGCACGCCAT GTCGCTATGC 2220 

TTGCAGCGCT CTCCGCATTG TTCAACGCTA GACCTCGTAT ACCCGAGAAG CTTGTTGAGG 22 80 

AGCCCCTGCC TTTCGACCCT CAGGCTGTGT ACGCGGGTGT GGGCGTTGGT GTAGCCTCTG 234 0 

CCCTGTTCAT GGTGTTGAAG ATCTGCGAGT TGCTCGCAGA CTGCGCCACC TGGTGGCATC 24 0 0 

TAGAGCAGTT CGCACACGCA CCTGTCTATG GCACGAGAAG CAATATACTC GTCGTGTATC 2460 

CGATCCTCGT TGTGAGAGGA GCACGCTAGA GGAGTATCTC TCGGCCTTCC GGGAGGCCGG 2 520 

GTTTGAGGTC ACCACTGTAC CCGTGTTGAA CGACCCTTGG TCTACAGCTA TTCTCCACGC 2 580 

TACGCTGGCC ATCTCCAGTG CTGCAGAGAC CGCCTTCAGT CGCGGCATTG AGGAGCCGGG 264 0 

ATATCGTGCA CATCCCGCGC TTAGCAGGCT AACCAGGCTG ATCTACCTAG AGGAGTAGAA 2 700 

CCTCTCGAGG ACCGGTATGT AGTGGTCTAG AGGCTTCCCG TCATGGTGTA TCGCGAGGCC 2 760 

TATTCCTGCT CTCCTCGCGC CTTCCACGTT GGGCTCATAA TCATCTATGA ATGCTGTTTT 2 82 0 

CGCTGGGTCC GCGCGAAGGA GTTGCATCGC CGCCTCGTAT ATCTTTGTGT GTGGCTTGCA 2 880 

AAAGCCGACA ATATCCCTCG TAACCACCGT ATCCACGAGG TGGGCTAGAT CGTCACGCTC 2 94 0 

TAGAAGTAGA CGTACGCATT CGTAGCACCA GTTGTTCGAG ACTATGCCGA CCAGTATCCC 3 000 

GTTTCTCTTG GCCCATCTTA GCAGCTCGTA TGTACCCGGT GCTACGTATA CGCCAGACAG 3060 

CACAGCTGAT TGCAATACCC TTGCTAATGC CTCTGCCCTT GAGGGGGTCG GCGTCAAGCC 312 0 

GTGTTTTGCG AGGAGCACGG CAGCCGCATA CACTATACTT TGTTGCACGG AG AC AT CC AG 3180 

CCTCCACGTG TCCATTACAC GCCTCACGCT ATC CGGCGTC GCGTCGGCCC CTAGGGCACG 3 24 0 

TAGATGTCTG GCAGCAGTCT CGTAGAGAGT CTCCTCGTAC CACTCATTTG TGAGGTAAAT 3300 

GACGCCACCT AAATCCAGCA GGAGTGTAGG GTTACGCGGC AAGGCGCCTC CTCATGTATT 3360 

CGAGGAGGCC GCCCGTTGCC AGAATTTCAG CTACAACACC CCGGAAGGGC GGGAAACGGT 3420 

ACGTCAACAC CCTACCATCC TTCTTGATGA GCTTCGCTAC ACCCTCGTCA AGGTTTATCT 34 8 0 

CTATCTCGTC GCCCTCCTCG GCCGCCTCCA CGAGCTCTGG GAG C ACT AT A ACGGGGAGCC 3 54 0 

CGTTGTTAAT CGCGTTACGG TAGAATATTC TCGAGAAGCT CTTCGCTATG ATGGCCTTGA 3600 

CGCCTGCAGC CTTGAGAGCT ATCGCGGCTT GCTCCCTGCT ACTACCCATA CCAAAGTTCC 3660 

TACCCGCGAC CAGCACTACA CCCTTGGACG CCTTCTTGGG GAACTCCGGA TCCAGAGGCT 3720 

CCATAGCATG CTCGGCAAGC TTCTCCGGCT CAGTATATAC CAGGTAGCGG GCAGGGATAA 3 780 

TCACGTCGGT GTTGATGTTA TTGCCGTAAT TGAGCACAGG GCCCTTCACG ACACCCAGGT 3 84 0 

TCAAGAGAGG TTCACCACAA GTTTGGCCTC GCTATCCCAG GCTATAATCC AGCTGTTTAC 3 900 

TCGGCCAGCT TCACCCACAC ACTTTTCAAC TCCATTATCC TTGTAGCGCA ATCTACCCTT 3 960 

CTGGGTAGCA CAGCGTTAAG CCCATAGTGC CAAGGCGCCA CAATGATGCC CTCCGGCACA 402 0 

TTCTCGTCGG GTATCAGCCG GAGGCGTATG GCCCCTCTCT CCGTCTCGAG CCTAGCGTGA 40 8 0 

CCGGCGCCAG CCTCCTTAGG GTTGACTCGT GCGTATAGCT CGCCGCTCAC ATC TAG CATC 414 0 

GCGTTTGTAC AGTAGCTCAC CGGGTCTCTT GCAGTCACGA GCACCTTCCT ATCACCATCG 4200 

GGCACGACCG GCTCGACCGG CGGGTATAGA CGGACGCGTA TCCTCGAGAC ACGCCTGGGC 42 60 

AGGAGGTACT CGCCTCTCTC CGCAACCGCC TTGGAGGAA 42 99 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TGGACTGATA AAGAAAAAGA AGAGGTTTAA GGGCCTCAAT ATTAAATTCT ACACATTAGA 60 

TATCCAAAAT GGAGAATTAC TTAATCTAGA GACTTACCTT AAGGAGTTAC ATGAGTTCCT 12 0 

TAGAGGCCTT ACATTAAAAC GAAAAGTAGA AGAGGAACAA TGACCCCCGA AGAGCTCCTA 180 

ACCCGCCTCG AATTCAAAGG AGTAACCCTC GAAAAGATGC TCAATACTGC GTTAGAGCTC 24 0 

TACATCGGCG ACGAGCGCGA GAAAGTTCGA GAAAGGCTGA GAGAGCTGAT GCTGAGGTAT 300 

CTGGGCG AC A TCAACGTTCA AGCTCTGCTC TTTTCGGCTC TACTGCTCGA AGAGAACTTC 360 

AAGGTTGAGG GCGACCCCGT GAACCTTGTG GCCGACGAGC TCATCGGCAT GAACATCGCC 420 

GAG CT CAT AG GTGGAAAGAT GGCGCTCTTC AACTTCTTCT ACTACGACAC CAAGAAGCCC 4 80 

GGCATTTTAG CCGAGCTTCC GCCTTTCCTC GACGATGCGA TAGGGGGCTT TATAGCGGGC 540 

TGTATGACAA GGCTGTTCGA GGGGGTGTAC GGTGCGGAAT CTCTTACCCT TCTTCACGCG 600 

GATTCCGGTC AAAGGCAACT TCAAAAGGGT TAGAAATGAG CTCTGGGCAC TTCCCATTCT 660 

CGCACCGGTA ACTTCGGCCC TGGCGACGCT CGTGGGCTCT GTGCTCGCCG GGGTAATAAT 720 

CCTGGGCGGC AACTACGCGT TTCACCCAAC GTCTCGGCAA CCCACGTGCT GATAACCCTC 7 80 

ATAGGCTTCG TCGTGGTCTA CAGCATACTG TTCTACATCT GGCTCCACTT CGTCAGGAAG 84 0 



CTCATCAGGG AGGGCCCCGA ACCGGTTGAG GGTGACGTCA CCGCGAAGCC GACCCCTGCC 900 

GTTAGCGCCG CGGGAGGTGG TCAGTGATGG ACTACGCGAC CGCATGGTTT TACTTCTCCG 960 

CCTTCCTCCT CGGAATGTAC TTAGCGTTTG ATGGCTTCGA CCTTGGCATA GGCGCGTTGC 102 0 

TCGCCCTGAT TAAGGACCAG AGGGAGCGCG ACATACTCGT GAACACCATC GCGCCGGTCT 108 0 

GGGACGGCAA CGAGGTCTGG TTCATCACCT GGGGTGCCGG GCTCTTCGCG ATGTGGCCGG 114 0 

CGCTCTACGC GACGCTCTTC AGCACGTTCT ACCTTGCCGT CTGGCTGCTC GCGTTCCTGT 1200 

TCATATTCAG GGCTGTCGGC TTTGAGTTCA GGAACAAGAA CAAGGAGCTA TGGGACAAGC 1260 

TCTTCGCTCT CGTCAGCGCG TTAATCCCGC TCGTCATCGG CGTCATAGTC GGCAACCTCA 1320 

TCATGGGAAT TCCCATTGAC GCCAAGGGCT TCCACGGCTC ACTGCTGACG CTCTTCAGGC 13 80 

CCTACCCGCT CATCGTCGGC CTCTTCATAC TCTTCGCGGT GACCTGGCAC GGAGCCAACT 144 0 

GGGGCGTCTA CAAAACCACA GGAAAGCTCC AGGAGCAGAT GAGGGAGCTC GCCTTCAAGG 1500 

CCTGGCTCCT GACCGTCGTC TTCCTCCTGC TCACAGTCAT CGGCATGAAA ATCTGGGCCC 1560 

CACTGAGGTT CGAGAGGGCA CTAACGCCGC TTGGGCTCCT CCTAACGGTT GTCATCCTCG 162 0 

TGGCAGGACT GCTCGACGGA CAGCTCATCA AGAAAGGGGA GGAGAATTTG GCCTTCTACA 1680 

TCAGCTGGCT GGCCTTCCCG CTCGTTGTGT TCCTCGTCTA CTACACAATG TACCCCTACT 174 0 

GGGTCATCTC GACCACCGAT CCGAACTTCA AGCTCAGCAT ACACGACCTC GCGGCATCTC 1800 

CGCTGACCCT CAAGGCCGTC TTGGGAATCT CGCTGATCCT GGCGGTCATC ATCATGGCCT 1860 

ACACCCTCTA CGTATACAGG GCCTTCGGCG GAAAGGTCAC CGAGGCGGAG GGCTACTACT 192 0 

GAGTTCCCCT TTCCTTTTTC GATATTCGAA CTTTTTTAGG GAAAAGTTTA TAATTCGAGT 1980 

CACCTAAGTT CCTTCTGGAA ACCTAAAAAA CGGTGGTCGA AATGCACAGA GGCAGATCTA 2 04 0 

CCGGCTGGCC CTACGACCGG AAGCCGGTCC TCGTCTTCTG GGAAACCACC AAAGCCTGCC 2100 

GGCTCAAGTG CAAGCACTGC AGAGCGGAGG CAATACTCCA GGCACTGCCG GGCGAGCTGA 2160 

ACACGGAGGA GGGAAAGGCC CTCATCGATT CCCTCACCGA CTTCGGAAGG CCCTACCCGA 2220 

TACTCATTCT CACCGGTGGC GACCCGCTCA TGAGGAAGGA CATCTTCGAG CTCATCGAGT 22 80 

ACGCCGTTGA GAAGGGCATT CGCGTTGGTC TCGCCCCCGC TGTAACGCCC CTCCTGACCG 2340 

AGGAAACAAT CGAGAGAATC GCGAGGAGCG GAGTTAAGGC GGTAAGCATA AGCCTCGACA 24 00 

GCCCGTTTCC AGAAGTTCAC GACGCAATCA GAGGCATAGA AGGGACGTGG GAGAAAACCG 24 60 

TCTGGGCCAT CAAGGAGTTC CTGAAACACG GCCTAAGCGT TCAGGTGAAC ACGGTTGTGA 2 520 

TGCGCGAGAC CGTTGAAGGA CTGCCCGAGA TGGTGAAACT GCTTAAAGAC CTCGGCGTCG 2580 

AAATCTGGGA GGTCTTCTAC CTCGTCCCGA CCGGGAGGGG CAACTTCGAG AGCGACCTGA 264 0 

GGCCGGAGGA GTGGGAGGAC GTCACACACT TCCTCTACGA GGCCTCGAAG CACCTCCTCG 2 7 00 

TGAGGACCAC CGAGGGCCCG ATGTTCAGGC GAGTGGCGAT AATGAGGAAA GCCCTTGAGG 2 760 

AGAAGGGATT CGACCCCGAC GAGGTTCTCA AGCCCGGGGA GCTCTACTTC CGGCTGAAGA 2 820 

AACGGCTCGT TGAGCTTCTC GGCGAGGGGA ACGAGGCGAG GGCCCAAACT ATGGGAACGC 2 880 

GCGACGGGAA GGGAATAGTC TTCATCGCCT ACAACGGCAA CGTCTACCCG AGCGGTTTCC 2 94 0 

TGCCCTTCAG CGTCGGCAAC GTCCGCGAGA AAAGTTTGGT TGAGATTTAC AGGGAGAGTG 3 000 

AACTTATGAA AAAGCTCCGC TCGGCCGAGT TCGAGGGGCG CTGCGGGAGG TGCGAGTTCA 3 0 60 

GGGAAATCTG CGGGGGAAGC AGGGCGAGGG CCTACGCCTA TCGCTTAAAC CCGCTCGCCG 312 0 

AAGACCCTGC CTGCCCGTAC GAGCCGGGCT CATACCTAAG GCTCGCCAAA AAGTTCAATC 3180 

TTCACCTTCC GATTGAGATT TTTGGAGCCC AAAAGCCGAT TTGAGGTGAT GGAAATGAGG 3240 

TGGAAGGCTG TTT TACT GAT TGGAATCCTC CTCGTGTCTG TCCTCGGTGC CGGATGCGTT 3 300 

GGCTCGAATA CCTCAACTGA AACCGGCCCA TCCCAGAAGG AAATAACCGT GAAGGACTTC 3 360 

TCGGGAAGGA ACATCACGGC TAAAGTTCCG GTTCAGCGGG CGGTCGTTCT CTCGACTTCC 342 0 

GCCCTCGAAA TAATCCAGCT CCTCAACGCG AGCGACCAGG TCGTCGGTAT TCCAAAGGAG 34 80 

GCCCAGTACG ACGCTTTACT GAGCGAAAGC CTGAAGAACA AGACCGTCGT TGGCGCGAGG 3 54 0 

CTCAAGATTG ACGACTGGGA GAAGGTTTTA GCCCTAAAGC CCGACCTAAT CATCGACCTC 3600 

GACCTGAAGA AGTTCTACAA CGTTGACGAG CTCCTCAACC GCTCCGCCAG CTACGGAATT 3 660 

CCGGTCGTCC TGCTGAGGGA GGATAACCTT GAGGACATAC CGAAGGCGGT TTCGCTCCTC 3720 

GGTCAGCTCT TCGGAAGGGA GAAAGAGGCC AAGGCCTTCG ACGACTACTT CAACGAGCAG 3780 

GTGAAGGAGG TTAAGGCCAT AGCCTCAAAG ATTCCAGCGG AGGAGAGAAA GAAGGCGATA 3 84 0 

ATGATACAGC CGATAATGGG CAAGCTCTAC CTCGTCAACG GCAACGACGT CCTTGCTCAG 3900 

GCCGTCAGGC TCGTTGGGGC GGACTACCTC GTGAACCTGA CCTTCAACGG CTACACTCCG 3 960 

GTTAGGGTCC CGATGGACGG GGAGAAGATA AT AG CG AACT ACCGCGATGC AGACGTCGTA 4 02 0 

ATCCTCCTGA CGAGCGCCGT AACGCCTTAC GACCAGGTCG AGAAGCTCCG GGAGGAGATG 4 08 0 

CTCAGCGACG AGGCCTGGAG GGG CATTAAG GCCGTCAGGG AGGGCAACGT AGTAATCCTC 414 0 

AGGGCGGACA TGGGTAAAGA CTCCTTCCTC CGCTGGAGCC CGCGCTTGGC AGTGGGAATC 4 200 

TGGGTCATTG GAAAGGCAAT CTACCCGGAC TACTATCCTG ACTGGAACGA CAAGGCCAAG 4 260 

GACTTTCTGA AGAGGTTTTA CGGCCTCTCC TGATTTTTCT TTTGGGGTGG GACGATGATA 4320 

GCGGTCTTTC CAGCGAGTCT CGCGGAAATC GTCAAACTCG TCGGGAAAGC CGGGGAGATA 43 80 

GCCGGAGTGA ACGAGGAAAT CAGGTTCGAC CCCTGCCTGC CGGAGCTGAA GGATAAGCCT 444 0 

GTCATCGGAA AGTACCTCAA GCGGAGCAAG AGGACCTACT GGGACGTTTT AGAGGAGCTT 45 00 

AGGCCGGACC TTATCCTCGA CTTCGATGTT GAGAACCTGC ACTCCGGGGA CGAGCTGAGG 4 560 

GCCTTTGGGG AGCGTATAGG GGCAAGGGTC GAGCTGATTG ACTTCGAGAC CGTTGAAGGC 4 62 0 

TTCGTCGAGG CGAGCAGGAG GATAGCCGAG CT7AACGAGGG GCGACTTTTC AAAGCTCGGC 4 680 

GGGTTCTATG AGAAGCACCT GACGAGGCTG GGTGAGATAA CTGAAGCCAT CGAGGAGAGG 4 74 0 



CCTAAAGCCC TGCTCACCTA CCGGAACTTC AACGTCGTAA CGAGGACCAA CGTTCTGAGC 4 800 

GACGCGGTTA GAAAAGCAGG GGCGATGAAC CTCGGCGAGA GGATACGGAC AAAGCGGAAG 4 860 

GTCTATCCGG TAAAGAAGGA GCGCTTCTTC AGGTCCTTCG GCGATGCGGA GCACCTCTTC 4 92 0 

CTGCTCACGA GCATAATGAC GGACAGGGAG AAAATGGAGG GGATAAGGGA TGAAATCCTT 4 980 

GACTCGGCCG AGTGGAGGGC AATGGAAGCC GTTCAGCTCG GAAACGTGCA CATAGTTGGC 5040 

TCGGCCCTCG ACCTTGAGAG CTTCATGCGC TGGAGTCCCC GCATAATCCC GGGAATCTAC 5100 

CAGCTTGGAA GGTTTATACA CGGAACAAAT CACCCACGAA TCTCGTGGAA ATCACTGCAA 5160 

AAGTTTAAAA TCCCCCTCCC ACCCCTCGAA GAACAAAAAC GCATCGTCGC CTACCTCGAC 5220 

TCGATACACG AGCGCGCCCA AAAGCTGGTA AAGCTCTACG AGGAGCGGGA GAAGGAGCTT 5280 

GAGAAGCTTT TCCCCGCGGT GCTTGATAGG GCGTTTAGGG GTGAGCTGTG ATTCCGGGAA 534 0 

TGGAATACGG CTTTGAGAGG GCAATCTTTG AGATAGTCAG CGGCTTTGTT CTCTCCCTCG 54 00 

TAGTCAGGGC TTTCGCTTAC AGTTTTGGTC TTCCATGGGT ATCCTTTTTG TTCAACGTTC 54 60 

TTTCGATACT TCTGACAATA GGCCTGATTG ACAAAATGCC CTTCTGGTCC ATGTCATATC 552 0 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39; 

AGCTTGGATA TCGAATTCCT TATATGAAAA ATT CATC GAA TTGGTAAAAA ACCACGATCT 60 

TCATGTGGAA ACTGGAATAT TTGCTGCGCA TATGCTTGTG GAAATACATA ACGATGGTCC 12 0 

GGTGACTTTG TTACTTGATT CAAGAAAAGG TATTTTGAAG TCATCTTTGC TGTCTCTAGG 180 

AGGACTATAT GCCTGAATAC TCGCATAGCA ATAAAAACAA CTTTTTTGCC GAAAACGATG 240 

TGAAGAATTG TCATCTACTG CATGTATGTT GTGCACCCGA TTTGGCAATT TCTTATTTGT 3 00 

CCGGTGCACG TGGTGATATT TTCTTTTACA ATCCTAACAT ACATCCAAAA GCTGAATACG 360 

AGAAACGACA CGCCGAAGTG ATTAAAATTG CTGCACTCTT TAAAATGAAT GTTCTGAAAG 4 20 

TTCCTTATAA TCCTGACCTG TTCTTCAAGC TTACTAAAGG ATTAAAAAAT GAACCTGAAG 4 80 

GCGGGACAAG GTGCGAGATT TGTATAAGAA TGCGACTAGA AAAAACAATG GAATACGCGA 54 0 

AAGAAAATGG CTACAAGAGT GTTTCCACAA CGCTAACAGC CTCTCCAAAG AAAAATGTAG 600 

CGATGATTGT GAAGATAGGA AAAGAACTGG AAAAAAAATA CGGTGTGGAA TTTTTGCCTA 660 

ATGTGTACCG CAAAAGTCCG CTTTACAACG ATGCGCAAAA GCTTATAACG AAAATGGGTT 720 

ATTTACAGAC AAAACTACTG TGGTTGTATT TTCTCAATAA GAACTTCCGT TATAGTAGCC 7 80 

ACTCAAGAAA CTAAAACCGT AAAAAGTGGG GTCGAAGTAT GAAAATATAC CACAAATTAG 84 0 

AAGAAGTTGA AGAACATAAG CGGTCGTATG CATCAATTGC TTTTTCATCG AAAGTCAGGG 90 0 

TTGAATATGA ACATGCTGGC GAAAAACTTG CCCTCATCCC TGTAACTATT GGAGACCTTA 960 

CGGTGGTTAT CGAAATTGAC GATGATAGAG AAGTATTCAA TACTTTGTTG AACGAGCACA 102 0 

TCAAAAACTC TATCCTGAAA CAGTTTCCGT ATCCGGAAGA GATTAGAGGG TTAGCCAGAC 1080 

ATTTTCGCAC AGAATTGAAG AATTTCAGAA TCTTGGTTGT AAAATACAAT AGTGTCGAAG 114 0 

AAAAGGAATT CTCAAGGTAT TCACTGTCTA ATATAACATT CGGTGTGGTG TCATACAATA 12 00 

AATTTGATGT CCATTTGTTA CCAAGTAATG TAAAAGTCAG ACCGAAGCCA GGATACTGTC 1260 

TTTCACATGT TGTCCAAAAG CCTGAAGAAG GTATCAGGCA AGCATTCTTG TTAGCCCGGT 132 0 

GGTTTGGTGG TGGAAGCTAC GACCAACTGC CCAAATTAGC GCTTGAAAGC ACTGACATTG 13 80 

ACCTTGGAAA GTGGACAAAT ATAGTCAAAT ACATCGTTCT GTCAGATTTT GAAAAGAGGT 144 0 

ATTTTTCTGG TATAATAAAA AAGCTAAACG AATTTAGAAG C GAGA CAT AT TTTGACCCAT 1500 

TTGCTAGGCT TGAAATGATA TCACTTGGCA TAATACTCGC CAAGTCAGAG GGAGGAGGTA 1560 

ACTTTGAACC AG AC AG TT AC GATATCATTT AGAGCACTTA CTGAAAATAT AAAATTAGCA 162 0 

CGAGTTGTTA TACATACTTT TCTAACATTC CGAGGAGTGT TCGATAAAGA TATATTCGAT 1680 

ACGGAATTGG CTGTAAACGA AGCGATTGCA AACATTATTC AGCATACATA CAAAGGTGAA 174 0 

CCAAACTACG TTGTGATGAC GCTCAATTGG ATAGAACCAG ATACACTCGA AGTGTTACTC 1800 

CGCGATTTTG GTCCAAAAGT GGACCCAACG AAAATCAAAC CACGAGATTT AGATGATATC 1860 

AGACCAGGAG GACTCGGAGT TTATATAATT CAACGCATCT TCGACATTAT GG AATTCCGA 192 0 

AACGTGAGTC ATGGAAATTT ACTTTATCTA AAACGCTCCT TCTTAATACC TCCTAAAAAG 1980 

CAGGAGCTTG GGAATTTAAA TAATGAACCC TATCGAGAAT ATTGAAAAAA CCGTCAAAAC 2 04 0 

GGGGGAAAGA AGACAAATGG GCTTGCTCAC AGGTTTGACA AAAAATCCAT CTTTCATGTC 2100 

TGCATTTTTT GGCTTTTTGG CAGCACAATT TTTGAAAGTG GT GAT AT AC A AAGATTTCCG 2160 

CGTATTTGGT AGATACGGTG GTATGCCCAG TGCTCATGTT GCAACAACCT CAGCATTAGC 2220 

TTGGGCTGTT GGTTACACTA CAGGTTTTGA TTCACCGCTT ACAGCCATCG CTGCAATTTT 22 80 

CCTTGCTATT ACAACAGCTG ATGCTGTTGG TTTACGAAGA AATGTCGACC CCAATAAAGG 2 340 



ACATACACTA ATGGAAGCTA TCTATGGCTT CTTACTTGGG TGGATAGTCG CTCTGCTTAC 24 00 

GGTTAAGTTG TATCGATAAT TTTGAATGAG TTGTAGTGAA ATAGCCCAAG TCTTTTTTCG 24 60 

CAATTACATC ATAATGCCAG GAGGGTAATT TACAATGTTT TTTAGATTAC CATTTAAAGT 2 520 

TTTTGTTTTT GCAGTTTTGT TGCTTGCCAT CTCGTTAACA AGTGTTGTTA GTTTTGGACA 2 580 

AGATGATGAG CAGATAAAAA CACCAAATTG GTTTAGAAGT GCGGTGATTA AG AAAAG AG C 2 64 0 

TGGTATGAAT CTAAAGACCG CCCCAGAGTT TGTAGATGAC CTATGGAATG CGATATACAC 2 700 

TATAGGCACA AAATACAACG TTCCCCCAAC GCTTATAGCC GCTGTCATTT CTGTAGAAAG 2760 

CAACTTCGCC AACGTGAAAG GTGCTGGAGA CGTGGTAGGA ATGATGCAAA TTTCTATCTC 2 820 

CACAGCCAAA AATATATCGA AACTCCTCGG CCTCGAACAA CCAAAAAACG GTTGGGATGA 2 880 

GCTCCTCACA AATTATTGGT TGAATATAAC TTACGGTACC G C AT AC AT CG CTTATCTTTA 2 94 0 

CAAAAAGCAT GGAACTTTAC AGAAAGCGCT CGAAGAATAC AACAACGGAA AAAATAAAAC 3 000 

TAAATACGCC CAGCTGATAC TACAACAATA CAACCTATAC GAGAGCCTCC ATTCTGCTGA 3 060 

AATAAGAAAT AACCAGCAAT TGGATACAGA TAATTCTTCG ACATCTTCTG AAGCAACAGA 3120 

TACTTTGAAT ACAACCAGTG CAACAAATTC ACAACCAACA TCAGATGCAT CAAATACATC 3180 

AGTTAACACT TCAGAAATCA AGTTCCCGCC TCTTTTCGGA GTTGCAGGTT ATTAAGATAT 3 24 0 

TTGTTCGGTA GTTACTTAGG AATGTGGGGT GTATAGTTTG GAAGATGAAA AAATGAAACC 3 300 

TGAAACGATA GTAAAAATTG AACATTTATC TTTTTCTTAC CCGAGTTTCA GTCTCAAAGA 33 60 

TGTAAGTTTT GAGGTTCGGA AGGGAAGTTT CTTCGGCATT ATTGGACCAA ATGGTTCGGG 3420 

AAAAACCACG CTACTCTCAC TCATTATGAA ATT CC AAAAG CCAAAAAGTG GGAAAATAAC 34 80 

AGTTGATGGG AACGATGTGC TCAGGCTATC TCACAAAAAA CTTGCACAAC TT AT AG CAT A 3 54 0 

CATCGCTCAA GACTTTAACC CTACATACGA TTTCACAGTT GAAGAATTGG TCGAAATGGG 3 600 

AGGAATCCCC CGCTCACCAC ATTTTTTCGA AACACCTGTT TACGAGGAAG AATTAGAAAA 3 660 

TGCACTCAAA ACTGTTGATT TGCTTGAATA CCGAAAAAGA ATATTCTCCA CTCTTAGTGG 3 72 0 

AGGACAACAG CGCAGGGTCT TGATTGCACG CGCAATCTAT CAAAACACAC CTATCATCAT 3 780 

TGCTGATGAA TTGGTTAATC ACTTGGATTT AGGGCAAGCA ATTAAAGTGT TAGATTATCT 3 84 0 

AAAACAACTT ACCGAATGTG GAAAGACGAT AATTGGACAT TCCACCTGCA GCCCGG 3 896 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

ATGTGCTGCA AGGCGATTAA GTTGGTAACG CCAGGTTTTC CCAGTCACGA CGTTGTAAAA 60 

CGACGGCCAG TGAATTGTAA TACGACTCAC TATAGGGCGA ATTGGGTACC GGGCCCCCCC 12 0 

TCGAGGTCGA CGGTATCGAT AAGCTTGATA TCGAATTCCG TACGAAATGC GGGAAAGAGA 180 

GAAGGAAAAG GAAAGAGAGC ACAGATTTGG AAATGAGACA GAACACGAGG AAGAGCATGG 24 0 

TATGGCAGAG CGTGAAAGAG CACATGAGAA CGAGTCTGAA GAAATGGGCA AGGGCGTTGG 3 00 

CATGGGCGCC CATGGAATGA AGATGGGCAA AGAAGCTCGC GAAATGGTGA AGGAAGAATA 3 60 

CAAGGAAGCA AAGGAGAGAT ACAAGAAGGC TAGAGAAGAG TTTGAAAGAG CAAAGAAGAT 4 20 

GGGATTGGAC ATCAGAGAGG AGCGCGGATT CAAGATGGCC AAGGGATTCA TGGTAGCTGG 480 

ACTAGACGTT GCTGAGATGT GGCTGGAGAG ACTGAAGGTA CAGGTCATGA ATATGGGTGA 540 

AGAGGCCAAG ATCACAGAGG AGACCAAACT GGAGCTGCTC GCAAAGATCG ACGAGAAGCT 600 

TGCAGAAATC AAAGAGCTGA AGAACGAAAT CAATGAGACC TCCTCACCTG AAGAGCTGAT 660 

AGAAACTGTC AAGAAAATCA GAAAGGAGTG GAGAGAAATC AGAGATGAAA TGAGGGCTCT 72 0 

TACTGGCTAT GTCGCCGTTG CCAAGGTGGA AAAGCTTGTT GAAAAGGCCA AGCAGGTAGA 780 

GCTAATGCTT GAGGCAAAGA TCGAGGAGCT CGATGCTGCA GGAGTTGATA CAACCAAACT 84 0 

CGAGGCAACA CTCGAGGACT TCTCGGCAAA GGTTAATGAA GCAGAAGATT TGATTGACAA 90 0 

GGCTGAAAAT CTGTTCGAGG AAGGCAACAT TGCTGAAGGA CACATGACTC TCAAGGAAGC 960 

CATAAAGACT CTCAAGGAAG CCTTCAAGGA TGTCAAGGAA GTTGTCAGCG AGATGAAGGA 102 0 

AATGAACCAG TATAGAGTTA GGGAGGGCAA GATCTTCTAC GGAAACGAGA CTGGAGAAGT 108 0 

CTGGGTGGAT GGTAATGGTA CTGCTGAGTT TAACGGTACC GGTATCGTTG TGATCAGAGG 114 0 

AAACGCAACA CTTGAGGTCG CACCAGAAGA TGCGATCGTG ACACTGGTCG GCTTCGGCGT 12 00 

GAAGAGCGTT GAGGGTGGCG TTTCAAGAGT CAGCGGAGAA GGTAAGGCAG TAATCAGAGG 12 60 

AGAAAACCTC ACCGTCAAGG TGGAAGGTGA CGACTTCAAG CTCATAGTGA AGGGCTACGG 1320 

TACACTCAAA CTCGATGGTG AGGGTGAATA CAGGG TAAAG AAGAGCCCAC AGGAAGAGAT 13 80 

GACATTTAAA CTCTTTCTTC AACTCTAGCA GTTTGAGCAT TGCATTTCCA AGATTTTTGC 144 0 

TGTTAGCTTC GGGACAACTT TGAAAATACG TCGAGACAGG CTCAAATGTT GTCCCAGCAT 1500 

TGCAGCTTTC GGCAAAGCGA ACGAGATTTG CGTTCCGCTC CCCAGCCCAA CATGGCTTCT 1560 



GTAATCTGAA 
AACACTTCTA 
TCTTCCCACA 
TCCAACCTGG 
AACATACATC 
AGTTCAAAAT 
GATTTTTCTG 
AGGCAGATGA 
TGCATATGAT 
ATCAGGCAGG 
TTTTGCAGCC 
CCTGAGTTCT 
AGGATGCACA 
ATCCATTGGT 
CAATTCCATG 
CTTTTCTTCA 
ATTAAATGGT 
AAACCATTTA 
TGCCACACAT 
TTGATCTGAT 
TATACAGGCT 
AAGAGTCAGC 
GTTTAGCATC 
TTTCTTGATT 
GCCATACGAA 
CCAAACCCAG 
ACAACAGTCC 
CACAGATGAA 



AAAACTTCAA 
AATCTTTCGA 
ACCTCGATAT 
CCGTTCAAAT 
TATAGAATTT 
TCATTCTTTT 
AGATAGTTCT 
TAACCGGTGA 
ATCGGATCGT 
AAAGTAGTAA 
ATTGAAATGT 
TTATAATCAT 
ACATCATAAT 
GTAACTACAT 
AGCATTCTGC 
TCTGCCATCC 
ATTAATTTTT 
AGGGCATCGA 
TGAGGACCCC 
AGAAGAGGAG 
TGCTACAAAA 
ATCTCTTATG 
AAAAATGCTC 
TTTGAACTGT 
AATTCCAATG 
ATTAAACTCA 
TATAATTGAC 
AGCTTTCCGG 



GTTCAACAGC 
GATTTATTGC 
GCGGCTCTTC 
CAATGAGCGT 
AAACGGTAAT 
AACTGCACTA 
GGTATCTCGT 
ATATCCTTAT 
GATCGATGTG 
TTATATCGGG 
GCTTATGAGC 
GCGGGAAGGG 
CGTTTGCCTC 
CTCCAATATA 
TTCCGATGAC 
CATACCAGGA 
TGCCGTATTT 
TGAAACATCC 
AAAATGGTGT 
CCCGAAGTTG 
CTTATAAAGT 
CTCATAAAGG 
CAGGATATGA 
ATTTTCTACA 
CCCAAATCCT 
ATTTCATCAA 
CAAACTTGAT 
ATTCCTGCAG 



TTTCCCAAAA 
ATTTCCTTTC 
CAGAGCAATA 
GATATGAATT 
TACTTAAGAA 
CAGCTCATCT 
ATCAACTATG 
TATCTCTCCA 
AGGATACTCC 
AATAAATGGA 
AACAACAACA 
ATAAGAGATT 
AAGTGGCTTT 
CCGAATGCAA 
AGCGACACTA 
AATTTTTCTC 
TGAGGAGGTA 
CAAAACCAGT 
GAGAAATGGA 
AGGAGGACGA 
TACTCGAAGA 
AAATTATCGG 
GTCTCGGGTT 
TGCTCTTTTC 
GGTAAATGTA 
CAGGAAAAAG 
AGATTACAAA 
CC 



ACATCCAAAA 
ACCGAAATGC 
CCCACTCCAC 
CTCGACGGAG 
GTTTTGGTTT 
GTGCCTTTTC 
TAAGCCTCGG 
GCCTGAACCG 
ACCTCGAAGA 
GCTCCGAGCT 
TCAATACCTT 
ATACACGAAT 
ATGCTGGCAT 
CCAACACCAC 
AAGTTCCTGA 
ATGGCAATAG 
GATATTAACC 
TCAGCAAAAA 
CGAACTGGGA 
CG AG AT TAAG 
TCTCAAAAGC 
TGAAGACAGA 
CGAAGAGGAC 
CCAACCACAT 
CTTTTTCATA 
AAAGAACGAA 
CACCACAGTT 



GCTTTTCCGC 162 0 

TATCGGATTC 16 8 0 

CGTCAATCCT 174 0 

TTTTAACCTT 180 0 

TGCGAAAAAG 1860 

TCCTTAATTC 192 0 

GAGCTATTAC 1980 

AGCATGTCAG 2 04 0 

AAGACACACC 210 0 

CTTCAGCAAC 2160 

TCAACTGTCT 22 20 

CAGAACTCAT 22 80 

CAAGCCTCAC 234 0 

TTCTCCAGAG 24 00 

GATAATCTAT 2 4 60 

CCCCGCATCC 252 0 

AATTATTTTC 2 580 

ATTAAATCAC 2 64 0 

GGAGTTATTT 2 7 0 0 

CTCGCAGAGA 2 760 

CATGAGCTTA 282 0 

GTTCTGGTTG 2 880 

GAAAAGTACG 2 94 0 

TCAGTTGCAT 3 000 

GTAAATGCTG 3 060 

AAAAAGACCT 312 0 

GGAATCAAAG 318 0 

3212 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION; 1 . . . 879 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GAA ATA ATA AAC AAA TTT CTA AAA AAA ATT GGA TAT AAG AAA GAT 
Met Glu lie lie Asn Lys Phe Leu Lys Lys lie Gly Tyr Lys Lys Asp 
15 10 15 

GGA GAA GAA AAA AAG GAC AAA TCT AAA ACC AAA ATA AAA ATT GAA GAA 
Gly Glu Glu Lys Lys Asp Lys Ser Lys Thr Lys lie Lys He Glu Glu 

20 25 30 

GAA AAA ACC ATG GAT ATC GAA ATT CCA AAA ATT GAA CCT ACT GAA AAT 
Glu Lys Thr Met Asp He Glu He Pro Lys He Glu Pro Thr Glu Asn 
35 40 45 

TTT AAT CGT GAT GAA ATT GTT TTT GAG GAA GAT AAT GCC TAC GGT ATA 
Phe Asn Arg Asp Glu He Val Phe Glu Glu Asp Asn Ala Tyr Gly He 
50 55 60 

TCC CAC AAA GGA AAT AGA ACA AAC AAC GAA GAC AAT ATT TTA ATT AGA 
Ser His Lys Gly Asn Arg Thr Asn Asn Glu Asp Asn He Leu lie Arg 
65 70 75 80 



48 



96 



144 



192 



140 



AAA ATA AAA GAT ACC TAC ATA TTA GCA GTT GCA GAT GGT GTC GGA GGG 2 88 

Lys He Lys Asp Thr Tyr lie Leu Ala Val Ala Asp Gly Val Gly Gly 

85 90 95 

CAC AGC TCA GGA GAT GTT GCA TCA AAG ATG GCA GTG GAT ATT TTA GAA 336 
His Ser Ser Gly Asp Val Ala Ser Lys Met Ala Val Asp lie Leu Glu 

100 105 110 

AAC ATT ATC ATG GAA AAA TAC AAT GAA AAC CTA TCA ATT GAA GAG ATA 3 84 

Asn He He Met Glu Lys Tyr Asn Glu Asn Leu Ser He Glu Glu He 
115 120 125 

AAA GAA CTT TTA AAA GAT GCA TAC ATT ACG GCA CAC AAC AAA ATA AAA 432 
Lys Glu Leu Leu Lys Asp Ala Tyr He Thr Ala His Asn Lys He Lys 
130 135 140 

GAA AAC GCT ATT GGA GAT AAA GAG GGA ATG GGA ACA ACA CTA ACA ACT 4 80 

Glu Asn Ala He Gly Asp Lys Glu Gly Met Gly Thr Thr Leu Thr Thr 
145 150 155 160 

GCA ATA GTT AAA GGG GAT AAA TGC GTT ATA GCA AAC TGC GGG GAT AGT 52 8 

Ala He Val Lys Gly Asp Lys Cys Val He Ala Asn Cys Gly Asp Ser 

165 170 175 

AGG GCT TAT TTA ATT AGA GAT GGA GAA ATA GTT TTT AGA ACA AAA GAC 576 
Arg Ala Tyr Leu He Arg Asp Gly Glu He Val Phe Arg Thr Lys Asp 

180 185 190 

CAC TCT TTG GTT CAG GTT TTA GTA GAT GAA GGA CAT ATT TCA GAG GAG 624 
His Ser Leu Val Gin Val Leu Val Asp Glu Gly His He Ser Glu Glu 
195 200 205 

GAC GCA AGG CAT CAT CCA ATG AAA AAT ATC ATT ACC TCA GCA TTG GGA 672 
Asp Ala Arg His His Pro Met Lys Asn He He Thr Ser Ala Leu Gly 
210 215 220 

TTG GAT GAA TTT AAG GTA GAT GAT TAC GAA TGG GAT TTA ATT GAT GGT 720 
Leu Asp Glu Phe Lys Val Asp Asp Tyr Glu Trp Asp Leu He Asp Gly 
225 230 235 240 

GAT GTA TTA TTG ATG AGC TCC GAT GGG CTT CAT GAT TAT GTC AGT AAG 768 
Asp Val Leu Leu Met Ser Ser Asp Gly Leu His Asp Tyr Val Ser Lys 

245 250 255 

GAA GAT ATT TTA AAA ACT GTA AAA AAT AAT GAT CAC CCA AAA GAT ATT 816 
Glu Asp He Leu Lys Thr Val Lys Asn Asn Asp His Pro Lys Asp He 

260 265 270 

GTA GAT GAA TTA TTC AAT ACT GCA TTA AAA GAG ACA AGG GAC AAT GTG 864 
Val Asp Glu Leu Phe Asn Thr Ala Leu Lys Glu Thr Arg Asp Asn Val 
275 280 285 

AGT ATT ATT CGT ATA 879 
Ser lie lie Arg He 
290 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Met 


Glu 


He 


lie 


Asn 


Lys 


Phe 


Leu 


Lys 


Lys 


He 


Gly 


Tyr 


Lys 


Lys 


Asp 


1 








5 








10 










15 




Gly 

* 


Glu 


Glu 


Lys 


Lys 


Asp 


Lys 


Ser 


Lys 


Thr 


Lys 


lie 


Lys 


He 


Glu 


Glu 






20 










25 










30 






Glu 


Lys 


Thr 


Met 


Asp 


He 


Glu 


lie 


Pro 


Lys 


He 


Glu 


Pro 


Thr 


Glu 


Asn 




35 








40 










45 








Phe 


Asn 


Arg 


Asp 


Glu 


lie 


Val 


Phe 


Glu 


Glu 


Asp 


Asn 


Ala 


Tyr 


Gly 


He 




50 








55 










60 










Ser 


His 


Lys 


Gly 


Asn 


Arg 


Thr 


Asn 


Asn 


Glu 


Asp 


Asn 


lie 


Leu 


lie 


Arg 


65 






70 










75 










80 


Lys 


He 


Lys 


Asp 


Thr 


Tyr 


He 


Leu 


Ala 


Val 


Ala 


Asp 


Gly 


Val 


Gly 


Gly 




85 










90 










95 




His 


Ser 


Ser 


Gly 


Asp 


Val 


Ala 


Ser 


Lys 


Met 


Ala 


Val 


Asp 


He 


Leu 


Glu 








100 








105 










110 






Asn 


He 


He 


Met 


Glu 


Lys 


Tyr 


Asn 


Glu 


Asn 


Leu 


Ser 


He 


Glu 


Glu 


He 






115 








120 










125 








Lys 


Glu 


Leu 


Leu 


Lys 


Asp 


Ala 


Tyr 


lie 


Thr 


Ala 


His 


Asn 


Lys 


He 


Lys 


130 










135 










140 










Glu 


Asn 


Ala 


lie 


Gly 


Asp 


Lys 


Glu 


Gly 


Met 


Gly Thr 


Thr 


Leu 


Thr 


Thr 


145 








150 










155 










160 


Ala 


He 


val 


Lys 


Gly 


Asp 


Lys 


Cys 


Val 


lie 


Ala 


Asn 


Cys 


Gly 


Asp 


Ser 








165 










170 










175 




Arg 


Ala 


Tyr 


Leu 


He 


Arg 


Asp 


Gly Glu 


lie 


Val 


Phe 


Arg 


Thr 


Lys 


Asp 




180 










185 










190 






His 


Ser 


Leu 


Val 


Gin 


Val 


Leu 


Val 


Asp 


Glu 


Gly His 


lie 


Ser 


Glu 


Glu 






195 










200 










205 








Asp 


Ala 


Arg 


His 


His 


Pro 


Met 


Lys 


Asn 


lie 


He 


Thr 


Ser 


Ala 


Leu 


Gly 


210 








215 










220 










Leu 


Asp 


Glu 


Phe 


Lys 


Val 


Asp 


Asp 


Tyr 


Glu 


Trp 


Asp 


Leu 


lie 


Asp 


Gly 


225 






230 










235 










240 


Asp 


Val 


Leu 


Leu 


Met 


Ser 


Ser 


Asp 


Gly 


Leu 


His 


Asp 


Tyr 


Val 


Ser 


Lys 








245 










250 










255 




Glu 


Asp 


He 


Leu 


Lys 


Thr 


Val 


Lys 


Asn 


Asn 


Asp 


His 


Pro 


Lys 


Asp 


lie 






260 








265 










270 






Val 


Asp 


Glu 


Leu 


Phe 


Asn 


Thr 


Ala 


Leu 


Lys 


Glu 


Thr 


Arg 


Asp 


Asn 


Val 




275 










280 










285 








Ser 


lie 
290 


He 


Arg 


He 

























(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1 ... 951 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG ACT CTG CTA GCC CTG TAT CAG AAT AAA CGT GTT ATC GTC AAG CTT 4 8 

Met Thr Leu Leu Ala Leu Tyr Gin Asn Lys Arg Val He Val Lys Leu 
15 10 15 



GGC TGG GGG AGC GGC ACT AGC CAA ATA ACT AAC GAG GCG CAA GTG CTG 
Gly Trp Gly Ser Gly Thr Ser Gin lie Thr Asn Glu Ala Gin Val Leu 



96 



20 



25 



30 



AGC GTA TTG CAC GAT ATG CCT ATA GTG CCC AGA CTG CAT ACC CGT CTA 
Ser Val Leu His Asp Met Pro lie Val Pro Arg Leu His Thr Arg Leu 
35 40 45 

GAC TTA GAT GAT GTC AAG CTC GTT GCG ATA GAG TAG ATA CCC TAC AAG 
Asp Leu Asp Asp Val Lys Leu Val Ala lie Glu Tyr He Pro Tyr Lys 
50 55 60 

AGC CTT AAC GCC GTC GGC CGC TTG AAC CCC CTT AAG GCT GTC ACA GCC 
Ser Leu Asn Ala Val Gly Arg Leu Asn Pro Leu Lys Ala Val Thr Ala 
65 70 75 80 

GTC TTC TAT ACA CTC GCA TCG CTA GTC CAT ATC CAC GGC CGT GGT TTT 
Val Phe Tyr Thr Leu Ala Ser Leu Val His He His Gly Arg Gly Phe 

85 90 95 

GCT CAT TGC GAC CTA AAG CCG GGT AAC GTT ATA CCA GTT CCC AAG CGT 
Ala His Cys Asp Leu Lys Pro Gly Asn Val He Pro Val Pro Lys Arg 

100 105 HO 

GGC ATG GTG TTC ATC GAC TTT GGT GTT GCA CGA CCT TTT GAC GCT GCG 
Gly Met Val Phe He Asp Phe Gly Val Ala Arg Pro Phe Asp Ala Ala 
115 120 125 

GGC TTC GCG GCA GGA ACA CCA GGG TAT ACG TGC CCA GAG GCT CTC GGC 
Gly Phe Ala Ala Gly Thr Pro Gly Tyr Thr Cys Pro Glu Ala Leu Gly 
130 135 140 

GGC GAG ACC CCC GGC TCT GGC TGC GAT CTC TAC AGC CTT GCC GGC ATA 
Gly Glu Thr Pro Gly Ser Gly Cys Asp Leu Tyr Ser Leu Ala Gly He 
145 150 155 160 

TAC TAC TAC TTG GTT ACC GGG TTA AGC CCG CCA CGC GAC CCA AAA GAG 
Tyr Tyr Tyr Leu Val Thr Gly Leu Ser Pro Pro Arg Asp Pro Lys Glu 

165 170 175 

TTC GCC AAG GCG CTC TCG TTG GCT CCC GCT CCA AGT AGC CTC TTG GAA 
Phe Ala Lys Ala Leu Ser Leu Ala Pro Ala Pro Ser Ser Leu Leu Glu 

180 185 190 

CTG TTC ACA CAG CTG GTG CTG GAT CCC GAG TAT CGT AAC AGC CTT GAT 
Leu Phe Thr Gin Leu Val Leu Asp Pro Glu Tyr Arg Asn Ser Leu Asp 
195 200 205 

CCT CTC CAG CTG TTG AAG ATT GTT GCA TCT TTT AAC CCG CAA CTG CTA 
Pro Leu Gin Leu Leu Lys He Val Ala Ser Phe Asn Pro Gin Leu Leu 
210 215 220 

GTC CCT CAT ATC GTT ATA GAT GGT GTT TAC AAG CCG CTA GGT TAC GGC 
Val Pro His He Val He Asp Gly Val Tyr Lys Pro Leu Gly Tyr Gly 
225 230 235 240 

GAG GTA AGC ATA GGC TCT AGA GGC GTT ATA CGT GTT GAT GGA CGA CCA 
Glu Val Ser He Gly Ser Arg Gly Val He Arg Val Asp Gly Arg Pro 

245 250 255 

GTG TAC CTC GCG GTT AAG AGG CAT GTG AGG GGC ACA AGT ATG TAC GCG 
Val Tyr Leu Ala Val Lys Arg His Val Arg Gly Thr Ser Met Tyr Ala 

260 265 270 

TAT ACG GAT CTT GTC GTG TTT AGG AGA GGC GAG AAA CTC ATA GTG AGA 
Tyr Thr Asp Leu Val Val Phe Arg Arg Gly Glu Lys Leu He Val Arg 
275 280 285 



* 



AGC GGT GAG AGT ATA GAC CTA GAG TTT AAC GAC CTG GTG TTG TTC GAC 
Ser Gly Glu Ser He Asp Leu Glu Phe Asn Asp Leu Val Leu Phe Asp 
290 295 300 

AAC CAC ATA CTA TAC GTA TTT ATC CTT CCG GAA AGG CCC 
Asn His He Leu Tyr Val Phe He Leu Pro Glu Arg Pro 
305 310 315 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met 


Thr 


Leu 


Leu 


Ala 


Leu 


Tyr 


Gin 


Asn 


Lys 


Arg 


Val 


He 


Val 


Lys 


Leu 


1 








5 










10 










15 




Glv 


Trp 


Gly 


Ser 


Gly 


Thr 


Ser 


Gin 


lie 


Thr 


Asn 


Glu 


Ala 


Gin 


Val 


Leu 


20 










25 










30 






Ser 


Val 


Leu 


His 


Asp 


Met 


Pro 


He 


Val 


Pro 


Arg 


Leu 


His 


Thr 


Arg 


Leu 






35 








40 










45 








Asp 


Leu 


Asp 


Asp 


Val 


Lys 


Leu 


Val 


Ala 


He 


Glu 


Tyr 


He 


Pro 


Tyr 


Lys 


50 










55 










60 










Ser 


Leu 


Asn 


Ala 


Val 


Gly 


Arq 


Leu 


Asn 


Pro 


Leu 


Lys 


Ala 


Val 


Thr 


Ala 


65 

V — ' 










70 










75 










80 


Val 


Phe 


Tyr 


Thr 


Leu 


Ala 


Ser 


Leu 


Val 


His 


He 


His 


Gly 


Arg 


Gly 


Phe 








85 










90 










95 




Ala 


His 


Cys 


Asp 


Leu 


Lys 


Pro 


Gly Asn 


Val 


He 


Pro 


Val 


Pro 


Lys 


Arg 






100 










105 










110 






Gly 


Met 


Val 


Phe 


He 


Asp 


Phe 


Gly 


Val 


Ala 


Arg 


Pro 


Phe 


Asp 


Ala 


Ala 




115 










120 










125 








Gly 


Phe 


Ala 


Ala 


Gly 


Thr 


Pro 


Gly Tyr 


Thr 


Cys 


Pro 


Glu 


Ala 


Leu 


Gly 


130 










135 










140 










Gly 


Glu 


Thr 


Pro 


Gly 


Ser 


Gly 


Cys 


Asp 


Leu 


Tyr 


Ser 


Leu 


Ala 


Gly 


He 


145 










150 










155 










160 


Tyr 


Tyr 


Tyr 


Leu 


Val 


Thr 


Gly 


Leu 


Ser 


Pro 


Pro 


Arg 


Asp 


Pro 


Lys 


Glu 








165 










170 










175 




Phe 


Ala 


Lys 


Ala 


Leu 


Ser 


Leu 


Ala 


Pro 


Ala 


Pro 


Ser 


Ser 


Leu 


Leu 


Glu 






180 










185 










190 






Leu 


Phe 


Thr 
195 


Gin 


Leu 


Val 


Leu 


Asp 
200 


Pro 


Glu 


Tyr 


Arg 


Asn 
205 


Ser 


Leu 


Asp 


Pro 


Leu 


Gin 


Leu 


Leu 


Lys 


He 


Val 


Ala 


Ser 


Phe 


Asn 


Pro 


Gin 


Leu 


Leu 




210 








215 










220 










Val 


Pro 


His 


He 


Val 


He 


Asp 


Gly 


Val 


Tyr 


Lys 


Pro 


Leu 


Gly 


Tyr 


Gly 


225 










230 










235 










240 


Glu 


Val 


Ser 


He 


Gly 


Ser 


Arg 


Gly Val 


He 


Arg 


Val 


Asp 


Gly 


Arg 


Pro 










245 










250 










255 




Val 


Tyr 


Leu 


Ala 


Val 


Lys 


Arg 


His 


Val 


Arg Gly 


Thr 


Ser 


Met 


Tyr 


Ala 








260 










265 










270 






Tyr 


Thr 


Asp 


Leu 


Val 


Val 


Phe 


Arg 


Arg 


Gly Glu 


Lys 


Leu 


He 


Val 


Arg 




275 










280 










285 








Ser 


Gly 
290 


Glu 


Ser 


He 


Asp 


Leu 
295 


Glu 


Phe 


Asn 


Asp 


Leu 

300 


Val 


Leu 


Phe 


Asp 


Asn 


His 


He 


Leu 


Tyr 


Val 


Phe 


He 


Leu 


Pro 


Glu 


Arg 


Pro 








305 










310 










315 













(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . . 1320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATG GAC ATC AGG GCC GTT GTT TTT GAC CTC GAC GGG ACG CTT GTG GGT 4 8 

Met Asp He Arg Ala Val Val Phe Asp Leu Asp Gly Thr Leu Val Gly 
15 10 15 

GCT GAG AAG ACT TTC AGC GAG ATA AAG TCC GAG CTT AAA GAA CGG CTG 96 
Ala Glu Lys Thr Phe Ser Glu He Lys Ser Glu Leu Lys Glu Arg Leu 

20 25 30 

ATT TCC TTA GGG ATT CCC AGG GAG CTC GTT GGA GAG CTA ACG CCG ATG 144 
He Ser Leu Gly He Pro Arg Glu Leu Val Gly Glu Leu Thr Pro Met 
35 40 45 

TAT GAG GGC CTT ATC GAG CTG TCC AGA AAA ACG GGC AGA CCT TTC GAA 192 
Tyr Glu Gly Leu He Glu Leu Ser Arg Lys Thr Gly Arg Pro Phe Glu 
50 55 60 

GAG ATG TAC TCA ATT CTC GTC AAT CTT GAA GTT GAA AGG ATA AGG GAC 24 0 

Glu Met Tyr Ser He Leu Val Asn Leu Glu Val Glu Arg He Arg Asp 
65 70 75 80 

AGC TTT CTC TTC GAG GGG GCA AGG GAG CTC CTC GAC TTT CTT GTG GGG 288 
Ser Phe Leu Phe Glu Gly Ala Arg Glu Leu Leu Asp Phe Leu Val Gly 

85 90 95 

GAG GGA ATA AAG CTT GCC CTC ATG ACC CGG AGC TCC AGA ATG GCT GCC 336 
Glu Gly He Lys Leu Ala Leu Met Thr Arg Ser Ser Arg Met Ala Ala 

100 105 110 

CTT GAG GCC CTG GAG CTT CAC GGC ATT AAG GAC TAC TTT GAG ATT ATT 3 84 

Leu Glu Ala Leu Glu Leu His Gly He Lys Asp Tyr Phe Glu He He 
115 120 125 

TCA ACG AGG GAT GAT GTC CCT CCC GAG GAG CTG AAA CCG AAT CCT GGC 4 32 

Ser Thr Arg Asp Asp Val Pro Pro Glu Glu Leu Lys Pro Asn Pro Gly 
130 135 140 

CAG CTG AGG AGA ATC CTC GGT GAG CTC AAC GTT CAA CCA GAG AAA GCC 480 
Gin Leu Arg Arg He Leu Gly Glu Leu Asn Val Gin Pro Glu Lys Ala 
145 150 155 160 

ATC GTC GTT GGA GAC CAC GGC TAC GAT GTC ATC CCT GCC CGG GAG CTC 52 8 

He Val Val Gly Asp His Gly Tyr Asp Val He Pro Ala Arg Glu Leu 

165 170 175 

GGC GCT CTG AGC GTC CTT GTC ACC GGC CAC GAG GCT GGC AGA ATG AGC 57 6 

Gly Ala Leu Ser Val Leu Val Thr Gly His Glu Ala Gly Arg Met Ser 

180 185 190 



TTT CAG GTT GAA GCC GAG CCA AAC TTT GAG GTC GAG AAC CTC ATT CAC 
Phe Gin Val Glu Ala Glu Pro Asn Phe Glu Val Glu Asn Leu He His 
195 200 205 



624 



CTC AGG AAG 
Leu Arg Lys 
210 

GCT TAC AAC 
Ala Tyr Asn 
225 

AGG TAT TTC 
Arg Tyr Phe 



GAT AGA ACG 
Asp Arg Thr 



CAT CTC GTC 
His Leu Val 
275 

TAT GCC ATC 
Tyr Ala lie 
290 

GGC CAG CAC 
Gly Gin His 
305 

GAG GGC AGG 
Glu Gly Arg 



AGC CAG ATG 
Ser Gin Met 



GTG ACC GCG 
Val Thr Ala 
355 

TTA AGG TGT 
Leu Arg Cys 
370 

GAC CGC TAT 
Asp Arg Tyr 
385 

GGC TGT AGA 
Gly Cys Arg 



TTT ATG AAG 
Phe Met Lys 



CTC TTC GAG 
Leu Phe Glu 



GAG GAG AAG 
Glu Glu Lys 
230 

AAA AAG GAC 
Lys Lys Asp 
245 

GAG GAG ATA 
Glu Glu He 
260 

AAC AGG GGG 
Asn Arg Gly 



AGA AAA AAC 
Arg Lys Asn 



CTT ATA AGC 
Leu He Ser 
310 

GCG GAC TTT 
Ala Asp Phe 
325 

CCC CTC GTG 
Pro Leu Val 
340 

GTT TTT GCT 
Val Phe Ala 



CTA AGC GGC 
Leu Ser Gly 



GCC GTG TCG 
Ala Val Ser 
390 

ATT GTC GAA 
He Val Glu 
405 

AAG GGG ACG 
Lys Gly Thr 
420 



AGG CTC CTG 
Arg Leu Leu 
215 

ACC ATC AAG 
Thr He Lys 



GAG ATA ATC 
Glu He lie 



GCT CGT TCT 
Ala Arg Ser 
265 

CTT GGT GGG 
Leu Gly Gly 
280 

GCC AAA CTT 
Ala Lys Leu 
295 

GAC GCC CTC 
Asp Ala Leu 



GCG GTC GGC 
Ala Val Gly 



AAG AAG TTC 
Lys Lys Phe 
345 

GGT AAA TAC 
Gly Lys Tyr 
360 

GAC TGC CTG 
Asp Cys Leu 
375 

AGT GAG ATT 
Ser Glu He 



GTT CCT ATC 
Val Pro He 



AAC GTT TTA 
Asn Val Leu 
425 



TCG AGC TAC 
Ser Ser Tyr 
220 

GGG GTA ATA 
Gly Val He 
235 

GTC GTG AAC 
Val Val Asn 
250 

TAC GGA GTC 
Tyr Gly Val 



GCC CTC GGA 
Ala Leu Gly 



GTC CTC ACA 
Val Leu Thr 
300 

CGC GTC ATG 
Arg Val Met 
315 

TCA AGG CTC 
Ser Arg Leu 
330 

GGC AAC TTC 
Gly Asn Phe 



GTC AGC GAC 
Val Ser Asp 



AGG AAA ATC 
Arg Lys He 
380 

ATA ATA GAG 
He He Glu 
395 

AAG GCT GTT 
Lys Ala Val 
410 

GAG GGC GTT 
Glu Gly Val 



GTT GTT GTT 
Val Val Val 



GAG AAT CTT 
Glu Asn Leu 



GAC GGC TCC 
Asp Gly Ser 
255 

CAC GTT CTT 
His Val Leu 
270 

ACG GGC TTT 
Thr Gly Phe 
285 

TTT GAT GCC 
Phe Asp Ala 



AGG CCA GTT 
Arg Pro Val 



AAA GGT GAC 
Lys Gly Asp 
335 

GTT CTA GAT 
Val Leu Asp 
350 

AGT CAG AGC 
Ser Gin Ser 
365 

AGG ATA ACC 
Arg He Thr 



GCC TCC AAA 
Ala Ser Lys 



TAC ACT GAG 
Tyr Thr Glu 
415 

AAG ATA GCC 
Lys He Ala 
430 



CCC 672 
Pro 



CTC 72 0 

Leu 

240 

AGG 768 
Arg 



ACG 816 
Thr 



GCC 864 
Ala 



GAC 912 
Asp 



GCG 960 

Ala 

320 

ACG 1008 
Thr 



GCC 1056 
Ala 



GGG 1104 
Gly 



TGC 1152 
Cys 



GCG 1200 

Ala 

400 

TAC 124 8 
Tyr 



CTG 12 96 
Leu 



AAC CTT CTC TTT GAC AAA CTG AGG 
Asn Leu Leu Phe Asp Lys Leu Arg 
435 440 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 440 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Ma f- 


Ab P 


T 1 a 
iJlC 


Aiy 


Al a 


Vjq 1 

Val 


Va 1 

v O X 


Phe 


Asp 


Leu 

J-J 1 *w ^J- 


Asp 


Gly 


Thr 


Leu 


Val 


Gly 


_L 






•J 










10 










15 




Mia 


ul U 


T 




Ph a 
rile 




m u 

VJ J- Li 


lie 


Lys 


Ser 


Glu 


Leu 


Lvs 


Glu 


Arg 


Leu 






20 










25 










30 






lie 


O £i >~ 
OCX 




(jiy 


11c 


r IU 


A-L y 


Glu 


Leu 


Val 


Glv 


Glu 

J- VA 


Leu 


Thr 


Pro 


Met 






j D 








40 










45 








Tyr 


Glu 


\j ly 


T oil 
Lie U 


Tip 

lie 


\J J. Li 


T . aii 

±J C Li 


Ser Arg 


J— j y >_) 


Thr 


Gly 


Arc 


Pro 


Phe 


Glu 


50 








55 










60 










Glu 


Met 


ryr 




Tip 

lie 


7 oil 


v a.1 


7\ i*^ 

Asn 


ijeu 


Gl 11 


Val 


Glu 


Ara 


He 


Arg 


Asp 


65 








70 










75 










80 


Ser 


Phe 


-bCU 




C~\ ii 


\j ±y 


Al Pi 


Arg 


Glu 


TiPii 




Asd 


Phe 


Leu 


Val 


Gly 


















90 










95 




Glu 


Gly 


Tip 


uy q 


T . Al 1 






Met 


Thr 


Ara 


Ser 


Ser 


Arg 


Met 


Ala 


Ala 






100 










105 










110 






Leu 


Glu 


a. j.a 

i. -L -3 




^jIU. 


T .01 1 


Ul Q 


Gly 
120 


lie 


T.VC! 
4J ¥ O 




Tvr 

iy j. 


Phe 
125 


Glu 


He 


He 


Ser 


Thr 


Arg 


ASp 


ASp 


v ell 


ir 1 (J 


Pro 


Glu 






xj y 0 


Pro 


Asn 


Pro Gly 




130 






1 3 5 

~L J ~J 










140 










Gin 


Leu 


Arg 


Arg 


T I a 

lie 


T 1 


\j± y 


Glu 


Leu 


A en 


V CI X 


m n 


pro 


Glu 


Liy s 


ai a 


145 






1 RO 










155 










IbU 


lie 


Val 


val 


(jiy 


rtb p 


11 1 0 


uiy 


Tyr 


Asp 


Vfl 1 


Tie 

_1_ -1_ w 


^ J_ v— ' 


Ala 


Aro 


Glu 


Leu 








1 D O 










170 










17b 




Gly Ala 


Leu 


t>e r 


\T a 1 

v a.i 


T . ai 1 


v a X 


Thr 


Gly 


11 X ZD 




Al a 


Glv 


Ara 


Met 


Ser 








i sn 
xov 










185 










190 






Phe 


Gin 


vai 

Ijj 


oiu 


TJl 3 

Aia 


Pii, 
LjIU 




Asn 
200 


Phe 




V^l 


{J J- <u 


205 




He 


His 


Leu 


Arg 
210 


ijy s 


T oil 


irllc 


m ii 


A T~rr 

215 


Leu 


Leu 


Ser 


Ser 


Tvr 

y 

220 


Val 


Val 


Val 


Pro 


Ala 


Tyr 




Ijl U 


ul U 






lie 


Lys 


Gly 


Val 


He 


Glu 


Asn 


Leu 


Leu 


225 








230 










235 










240 


Arg 


Tyr 


rue 


uy s 


T \ t a 

iSy S 




m n 

Vj J. u. 


lie 


He 


Val 


Val 




Asn 

£\. (J ha/ 


Gly 


Ser 


Arg 




i _j 










250 










255 




Asp 


Arg 


T'Vi v 

I U.L 


Pin 
LrlU 


Pin 
ul U 


lie 


Ala 


Arg 


Ser 


Tyr 


Gly 


Val 


His 


Val 


Leu 


Thr 




9 £ A 
Z O U 










265 










270 






His 


Leu 


vai 


Asn 


Arg 


uiy 


T ^ 1 1 
LicU 


Gly Gly 


Ala 


Leu 


rtl V 
vj x y 


Th t" 
1 ii j- 


Glv 


Phe 


Ala 






275 










280 










285 








Tyr 


Ala 


Tic* 

lie 




iiy t> 




Al ^ 


Lys 


Leu 


Val 


Leu 


Thr 


Phe 


Asp 


Ala 


Asp 


290 










295 










300 










Gly 


Gin 


HIS 


Leu 


rid 

lie 


C Q V 


Asp 


Ala 


Leu 


Arg 


Val 


Ma(- 


A TTT 
ni y 


Pro 


Val 


Ala 


305 










310 










315 










320 


Glu 


Gly 


Arg 


Ala 


Asp 


Phe 


Ala 


Val 


Gly 


Ser 


Arg 


Leu 


Lys 


Gly 


Asp 


Thr 






325 










330 










335 




Ser 


Gin 


Met 


Pro 


Leu 


Val 


Lys 


Lys 


Phe 


Gly Asn 


Phe 


Val 


Leu 


Asp 


Ala 








340 










345 










350 






Val 


Thr 


Ala 
355 


Val 


Phe 


Ala 


Gly 


Lys 
360 


Tyr 


Val 


Ser 


Asp 


Ser 
365 


Gin 


Ser 


Gly 


Leu 


Arg 
370 


Cys 


Leu 


Ser 


Gly 


Asp 
375 


Cys 


Leu 


Arg 


Lys 


He 
380 


Arg 


He 


Thr 


Cys 


Asp 


Arg 


Tyr 


Ala 


Val 


Ser 


Ser 


Glu 


He 


He 


He 


Glu 


Ala 


Ser 


Lys 


Ala 


385 






390 










395 










400 


Gly 


Cys 


Arg 


lie 


Val 


Glu 


Val 


Pro 


He 


Lys 


Ala 


Val 


Tyr 


Thr 


Glu 


Tyr 




405 










410 










415 




Phe 


Met 


Lys 


Lys 


Gly 


Thr 


Asn 


Val 


Leu 


Glu 


Gly 


Val 


Lys 


He 


Ala 


Leu 






420 










425 










430 






Asn 


Leu 


Leu 
435 


Phe 


Asp 


Lys 


Leu 


Arg 
440 



















(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1. . .795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

ATG GAA AAT CTT GAA AAA CTC CTT GAA GTG GCA AAG ATG GCA GCC CTT 4 8 

Met Glu Asn Leu Glu Lys Leu Leu Glu Val Ala Lys Met Ala Ala Leu 
15 10 15 

GCC GGA GGA CAG GTA TTA AAG GAA AAC TTC GGA AAG ATT AAG CTT GAA 96 
Ala Gly Gly Gin Val Leu Lys Glu Asn Phe Gly Lys lie Lys Leu Glu 

20 25 30 

AAC ATT GAA GAA AAG GGA GAG AAG GAC TTC GTG AGC TAC GTT GAT AAA 14 4 

Asn He Glu Glu Lys Gly Glu Lys Asp Phe Val Ser Tyr Val Asp Lys 
35 40 45 

ACC TCC GAA GAG AGA ATA AAA GAG CTA ATA CTT AAG TTC TTT CCC GAC 192 
Thr Ser Glu Glu Arg He Lys Glu Leu He Leu Lys Phe Phe Pro Asp 
50 55 60 

CAC GAG GTC GTG GGG GAG GAA AGG GGA AAG GAG GGA AAA GAA AGC CCT 24 0 

His Glu Val Val Gly Glu Glu Arg Gly Lys Glu Gly Lys Glu Ser Pro 
65 70 75 80 

TAC AAA TGG TTC ATA GAC CCC CTT GAT GGG ACC AAG AAC TAC ATA AAG 2 88 

Tyr Lys Trp Phe He Asp Pro Leu Asp Gly Thr Lys Asn Tyr He Lys 

85 90 95 

GGC TTT CCC ATA TTT GCA GTC TCC GTC GGA CTC GTT AAG GAA AAC GAA 33 6 

Gly Phe Pro He Phe Ala Val Ser Val Gly Leu Val Lys Glu Asn Glu 

100 105 110 

CCT ATA GTG GGA GCG GTT TAC CTT CCT TAC TTT GAT ACC CTA TAC TGG 3 84 

Pro He Val Gly Ala Val Tyr Leu Pro Tyr Phe Asp Thr Leu Tyr Trp 
115 120 125 

GCT TCA AAG GGA AGG GGA GCC TAT AAA AAC GGG GAG AGG ATA AGC GTA 4 32 

Ala Ser Lys Gly Arg Gly Ala Tyr Lys Asn Gly Glu Arg lie Ser Val 
130 135 140 

AAG GAA AGG GGG GAG CTC AAG CAC GCG GCG GTT GTT TAC GGA TTT CCA 4 80 

Lys Glu Arg Gly Glu Leu Lys His Ala Ala Val Val Tyr Gly Phe Pro 
145 150 155 160 

TCA AGA AGC AGG AGG GAT ATA TCT CTT TAC CTG AAT GTG TTT AAA GAG 528 
Ser Arg Ser Arg Arg Asp He Ser Leu Tyr Leu Asn Val Phe Lys Glu 

165 170 175 



GTC TTT TAC GAA GTA GGT TCC GTT AGG AGG CCC GGG GCC GCA GCG GTT 576 
Val Phe Tyr Glu Val Gly Ser Val Arg Arg Pro Gly Ala Ala Ala Val 

180 185 190 



GAT ATA TGC 
Asp He Cys 
195 

GAG ATG AAG 
Glu Met Lys 
210 

GCT GGA GGA 
Ala Gly Gly 
225 

ATA ATA GCG 
He He Ala 



AAT AAA TAC 
Asn Lys Tyr 



ATG CTT GCG 
Met Leu Ala 



CCA TGG GAC 
Pro Trp Asp 



TTT TAC ACA 
Phe Tyr Thr 
230 

GGA AAC AGG 
Gly Asn Arg 
245 

ATG AAT AAT 
Met Asn Asn 
260 



GAG GGC ATA 
Glu Gly He 
200 

ATA ACG GCG 
He Thr Ala 
215 

CTG AAG GGA 
Leu Lys Gly 



ATG CTC CAC 
Met Leu His 



GAA AGC ACG 
Glu Ser Thr 
265 



TTT GAC GGG 
Phe Asp Gly 



GGA CTC GTA 
Gly Leu Val 
220 

GAC CCC TTC 
Asp Pro Phe 
235 

GAC TTC ATT 
Asp Phe He 
250 



ATG ATG GAG 
Met Met Glu 
205 

ATA CTG AAG 
lie Leu Lys 



GGC ATC TCG 
Gly lie Ser 



CTC AAG GTT 
Leu Lys Val 
255 



TTT 624 
Phe 



GAA 672 
Glu 



GAC 72 0 

Asp 

240 

GTG 768 
Val 



795 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met 


Glu 


Asn 


Leu 


1 








Ala 


Gly 


Gly 


Gin 








20 


Asn 


He 


Glu 


Glu 






35 




Thr 


Ser 


Glu 


Glu 




50 






His 


Glu 


Val 


Val 


65 








Tyr 


Lys 


Trp 


Phe 


Gly 


Phe 


Pro 


He 








100 


Pro 


He 


Val 


Gly 






115 




Ala 


Ser 


Lys 


Gly 




130 






Lys 


Glu 


Arg 


Gly 


145 








Ser 


Arg 


Ser 


Arg 


Val 


Phe 


Tyr 


Glu 








180 


Asp 


He 


Cys 


Met 






195 




Glu 


Met 


Lys 


Pro 




210 






Ala 


Gly 


Gly 


Phe 


225 








He 


He 


Ala 


Gly 



Glu 


Lys 


Leu 


Leu 


5 








Val 


Leu 


Lys 


Glu 


Lys 


Gly 


Glu 


Lys 








40 


Arg 


He 


Lys 


Glu 






55 




Gly 


Glu 


Glu 


Arg 




70 






He 


Asp 


Pro 


Leu 


85 








Phe 


Ala 


Val 


Ser 


Ala 


Val 


Tyr 


Leu 








120 


Arg 


Gly 


Ala 


Tyr 






135 




Glu 


Leu 


Lys 


His 




150 






Arg 


Asp 


He 


Ser 


165 








Val 


Gly 


Ser 


Val 


Leu 


Ala 


Glu 


Gly 








200 


Trp 


Asp 


He 


Thr 






215 




Tyr 


Thr 


Leu 


Lys 




230 






Asn 


Arg 


Met 


Leu 



245 



Glu 


Val 


Ala 


Lys 




10 






Asn 


Phe 


Gly 


Lys 


25 








Asp 


Phe 


Val 


Ser 


Leu 


He 


Leu 


Lys 








60 


Gly 


Lys 


Glu 


Gly 






75 




Asp 


Gly 


Thr 


Lys 




90 






Val 


Gly 


Leu 


Val 


105 








Pro 


Tyr 


Phe 


Asp 


Lys 


Asn 


Gly 


Glu 








140 


Ala 


Ala 


Val 


Val 






155 




Leu 


Tyr 


Leu 


Asn 




170 






Arg 


Arg 


Pro 


Gly 


185 








He 


Phe 


Asp 


Gly 


Ala 


Gly 


Leu 


Val 








220 


Gly 


Asp 


Pro 


Phe 






235 




His 


Asp 


Phe 


He 




250 







Met Ala Ala Leu 
15 

He Lys Leu Glu 
30 

Tyr Val Asp Lys 
45 

Phe Phe Pro Asp 

Lys Glu Ser Pro 

80 

Asn Tyr lie Lys 
95 

Lys Glu Asn Glu 
110 

Thr Leu Tyr Trp 
125 

Arg He Ser Val 

Tyr Gly Phe Pro 

160 

Val Phe Lys Glu 
175 

Ala Ala Ala Val 
190 

Met Met Glu Phe 
205 

He Leu Lys Glu 

Gly lie Ser Asp 

240 

Leu Lys Val Val 
255 



Asn Lys Tyr Met Asn Asn Glu Ser Thr 

260 265 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1. . .591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ATG AGT GAA CAG CCG GTA TTG TCT GTT CAA GGA TTA AGC GGC GGG TAT 4 8 

Met Ser Glu Gin Pro Val Leu Ser Val Gin Gly Leu Ser Gly Gly Tyr 
15 10 15 

AGC ATG AAC CGA CCG GTT CTG CAT GAC GTA ACC TTT CAG GTT GAA CCG 96 
Ser Met Asn Arg Pro Val Leu His Asp Val Thr Phe Gin Val Glu Pro 

20 25 30 

GGT GAG ATG GTG GGT TTG ATC GGC CTG AAC GGT GCG GGC AAG AGT ACC 144 
Gly Glu Met Val Gly Leu lie Gly Leu Asn Gly Ala Gly Lys Ser Thr 
35 40 45 

ACG ATG AAG CAT ATT CTC GGG CTG ATG AAT CCG CAA AAA GGG AGC ATT 192 
Thr Met Lys His lie Leu Gly Leu Met Asn Pro Gin Lys Gly Ser lie 
50 55 60 

CAG GTT CAA GGA AAG AGC CGG ACA GAG CAT TCG GAA GCC TAT CAC GGC 24 0 

Gin Val Gin Gly Lys Ser Arg Thr Glu His Ser Glu Ala Tyr His Gly 
65 70 75 80 

GCC TTG GCG TTT GTT CCC GAA TCC CCG CTG CTG TAT GAG GAG ATG ACA 288 
Ala Leu Ala Phe Val Pro Glu Ser Pro Leu Leu Tyr Glu Glu Met Thr 

85 90 95 

GTA CGA GAG CAT CTG GAA TTT ACG GCG CGC TCC TAT GGC GTA TCC CGT 33 6 

Val Arg Glu His Leu Glu Phe Thr Ala Arg Ser Tyr Gly Val Ser Arg 

100 105 110 

GAA GAT TAT GAG GCA CGT TCG GAG CAG CTG TCG AAG ATG TTC CGT ATG 3 84 

Glu Asp Tyr Glu Ala Arg Ser Glu Gin Leu Ser Lys Met Phe Arg Met 
115 120 125 

GAA GAG AAG ATG GAC AGC CTG TCC ACG CAT TTG TCC AAA GGG ATG CGC 432 
Glu Glu Lys Met Asp Ser Leu Ser Thr His Leu Ser Lys Gly Met Arg 
130 135 140 

CAA AAA GTG ATG ATC ATG TGC GCA TTC GTA GCC AGA CCG TCC CTG TAC 480 
Gin Lys Val Met He Met Cys Ala Phe Val Ala Arg Pro Ser Leu Tyr 
145 150 155 160 

ATC ATT GAC GAG CCC TTT CTT GGG CTT GAT CCG CTT GGG ATA CGC TCG 52 8 

He He Asp Glu Pro Phe Leu Gly Leu Asp Pro Leu Gly He Arg Ser 

165 170 175 



CTG CTT GAC TTC ATG CTG GAG CTG AAG GCA TCC GGC GCT TCG GTA TTG 
Leu Leu Asp Phe Met Leu Glu Leu Lys Ala Ser Gly Ala Ser Val Leu 



576 



180 



185 



190 



CTA AGC TCC CAC ATT 
Leu Ser Ser His He 
195 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 50: 



Met 


Ser 


Glu 


Gin 


Pro 


Val 


Leu 


Ser 


Val 


Gin 


Gly 


Leu 


Ser 


Gly 


Gly 


Tyr 


1 








5 










10 










15 




Ser 


Met 


Asn 


Arg 
20 


Pro 


Val 


Leu 


His 


Asp 
25 


Val 


Thr 


Phe 


Gin 


Val 
30 


Glu 


Pro 


Gly Glu 


Met 


Val 


Gly 


Leu 


He 


Gly 


Leu 


Asn 


Gly Ala 


Gly 


Lys 


Ser 


Thr 






35 










40 










45 








Thr 


Met 

50 


Lys 


His 


He 


Leu 


Gly 
55 


Leu 


Met 


Asn 


Pro 


Gin 
60 


Lys 


Gly 


Ser 


He 


Gin 


Val 


Gin 


Gly 


Lys 


Ser 


Arg 


Thr 


Glu 


His 


Ser 


Glu 


Ala 


Tyr 


His 


Gly 


65 










70 










75 










80 


Ala 


Leu 


Ala 


Phe 


Val 
85 


Pro 


Glu 


Ser 


Pro 


Leu 
90 


Leu 


Tyr 


Glu 


Glu 


Met 
95 


Thr 


val 


Arg 


Glu 


His 
100 


Leu 


Glu 


Phe 


Thr 


Ala 
105 


Arg 


Ser 


Tyr 


Gly 


Val 
110 


Ser 


Arg 


Glu 


Asp 


Tyr 
115 


Glu 


Ala 


Arg 


Ser 


Glu 
120 


Gin 


Leu 


Ser 


Lys 


Met 

125 


Phe 


Arg 


Met 


Glu 


Glu 
130 


Lys 


Met 


Asp 


Ser 


Leu 
135 


Ser 


Thr 


His 


Leu 


Ser 
140 


Lys 


Gly 


Met 


Arg 


Gin 


Lys 


Val 


Met 


He 


Met 


Cys 


Ala 


Phe 


Val 


Ala 


Arg 


Pro 


Ser 


Leu 


Tyr 


145 










150 










155 










160 


He 


He 


Asp 


Glu 


Pro 
165 


Phe 


Leu 


Gly 


Leu 


Asp 
170 


Pro 


Leu 


Gly 


He 


Arg 

175 


Ser 


Leu 


Leu 


Asp 


Phe 


Met 


Leu 


Glu 


Leu 


Lys 


Ala 


Ser 


Gly Ala 


Ser 


Val 


Leu 








180 










185 










190 






Leu 


Ser 


Ser 
195 


His 


He 

























(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1 ... 1482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-51: 

ATG AAG AAA ATA ACT ATT AGT AGT TTG CTT CTA CTT TTA CTT ATT TCT 
Met Lys Lys He Thr He Ser Ser Leu Leu Leu Leu Leu Leu He Ser 
15 10 15 



ACC AAT TTG 
Thr Asn Leu 



ATA ATA ATC 
He He He 
35 

ACA AAG CTT 
Thr Lys Leu 
50 

GGA TTC GAA 
Gly Phe Glu 
65 

GCA GCA GGA 
Ala Ala Gly 



ATT TCA GTT 
He Ser Val 



CTT GAA ATA 
Leu Glu He 
115 

ACT AGA ATT 
Thr Arg lie 
130 

GAC AGA GAT 
Asp Arg Asp 
145 

GTC AAC GTC 
Val Asn Val 



CTA AAA ATG 
Leu Lys Met 

GAG CTC GAG 
Glu Leu Glu 
195 

AGC CAC ATT 
Ser His He 
210 

TTG GAA ATG 
Leu Glu Met 
225 

GGG TTC TTT 
Gly Phe Phe 



GAG AAT GAT 
Glu Asn Asp 



AAT CTC GCA 
Asn Leu Ala 
20 

CTC ATT GGA 
Leu He Gly 



GTT TAT GGT 
Val Tyr Gly 



CTT ACT GAG 
Leu Thr Glu 
70 

ACT GCA ATA 
Thr Ala He 
85 

ACT AAC ATA 
Thr Asn He 
100 

GCC CAG GTA 
Ala Gin Val 



ACA CAC GCA 
Thr His Ala 



ATG GAA GAG 
Met Glu Glu 
150 

CTA TTA GGT 
Leu Leu Gly 
165 

GCA AAA GAA 
Ala Lys Glu 
180 

AAA GCA GAG 
Lys Ala Glu 



CCT TAC GTA 
Pro Tyr Val 



ACT AAA AAA 
Thr Lys Lys 
230 

CTC ATG ATT 
Leu Met He 
245 

ATA GCA TCA 
He Ala Ser 
260 



TAC GAT TCC 
Tyr Asp Ser 
25 

GAC GGC ATG 
Asp Gly Met 
40 

CAT CTA AAC 
His Leu Asn 
55 

TCA TTA AGT 
Ser Leu Ser 



GCA ACT GGA 
Ala Thr Gly 



ACT GGA AAA 
Thr Gly Lys 
105 

CTT GGA AAA 
Leu Gly Lys 
120 

ACC CCT GCA 
Thr Pro Ala 
135 

GAA ATA GCG 
Glu He Ala 



GGA GGG AGA 
Gly Gly Arg 



CAG GGA TAT 
Gin Gly Tyr 
185 

GGT GAG TTT 
Gly Glu Phe 
200 

TTG GAC AGA 
Leu Asp Arg 
215 

GCA ATT TCA 
Ala He Ser 



GAA GGG GGC 
Glu Gly Gly 



GTT GTT GCA 
Val Val Ala 
265 



CAA GAG AGC 
Gin Glu Ser 



GGA ATG AGT 
Gly Met Ser 



ATG GAA GAG 
Met Glu Glu 
60 

GGG GAA GTT 
Gly Glu Val 
75 

GTC AAA ACA 
Val Lys Thr 
90 

GTT ACA AAT 
Val Thr Asn 



TCA ACT GGA 
Ser Thr Gly 



GTA TTT GCT 
Val Phe Ala 
140 

AGA CAG CTC 
Arg Gin Leu 
155 

AAG AAA TTT 
Lys Lys Phe 
170 

AAT ATA GTC 
Asn lie Val 



ATT CTA GGG 
He Leu Gly 



AAA CCA GAA 
Lys Pro Glu 
220 

ATA CTA GAG 
lie Leu Glu 
235 

AGA ATT GAT 
Arg lie Asp 
250 

GAG ACT AAG 
Glu Thr Lys 



GGT ATT AAA 
Gly He Lys 
30 

CAT GTC CAG 
His Val Gin 
45 

TTC CCA ATT 
Phe Pro lie 



ACG GAC TCC 
Thr Asp Ser 



TAT AAT CGA 
Tyr Asn Arg 
95 

CTA ACT ACC 
Leu Thr Thr 
110 

CTT GTG ACT 
Leu Val Thr 
125 

TCC CAC GTT 
Ser His Val 



ATA GCT CAC 
lie Ala His 



GAC GAG AAT 
Asp Glu Asn 
175 

TTC ACG AAA 
Phe Thr Lys 
190 

CTT TTT GCA 
Leu Phe Ala 
205 

GAT GTT GGA 
Asp Val Gly 



AAA AAT CCA 
Lys Asn Pro 



CAT GCA GCT 
His Ala Ala 
255 

GAG TTT GAT 
Glu Phe Asp 
270 



AAT 96 
Asn 



ATT 144 
He 



ATT 192 
He 



GCT 240 
Ala 
80 

ATG 288 
Met 



TTG 336 
Leu 



ACT 3 84 

Thr 



CCT 432 
Pro 



CGG 480 

Arg 

160 

ACC 528 
Thr 



GAA 576 
Glu 



GAT 624 
Asp 



CTT 672 
Leu 



AAT 720 

Asn 

240 

CAT 768 
His 



GAC 816 
Asp 



GTT GTT GGA TAT GTT CTT GAG TAT GCA AAA AAG AGG GGA GAT AC A CTA 864 
Val Val Gly Tyr Val Leu Glu Tyr Ala Lys Lys Arg Gly Asp Thr Leu 
275 280 285 

GTA ATA GTG CTG GCT GAC CAT GAG AC A GGG GGG CTT GGA TTA GGT CTA 912 
Val He Val Leu Ala Asp His Glu Thr Gly Gly Leu Gly Leu Gly Leu 
290 295 300 

ACA TAT GGA GAT GCA ATT AAT GAA GAT GTC ATC AGG AAC ATA AAC GCT 960 
Thr Tyr Gly Asp Ala lie Asn Glu Asp Val He Arg Asn He Asn Ala 
305 310 315 320 

AGT GTG TCG AAA ATT GCT AGT GAA ATA AGG GCA ACG AAT GAC ATA AAG 100 8 
Ser Val Ser Lys He Ala Ser Glu He Arg Ala Thr Asn Asp He Lys 

325 330 335 

AGA GTT ATC AAA AAA TAT ACT GGA TTC GAG CTA ACA GAG GAC GAA ATT 1056 
Arg Val He Lys Lys Tyr Thr Gly Phe Glu Leu Thr Glu Asp Glu lie 

340 345 350 

AAT TAC ATT GAG GAA GCT ATA AAC TTA GCA GAC GAA TAT GCG CTT CAA 1104 
Asn Tyr He Glu Glu Ala He Asn Leu Ala Asp Glu Tyr Ala Leu Gin 
355 360 365 

AAT GCA ATA GCT GAT ATT ATA AAC AAA CGC GTT GGT GTA GGT TTT GTA 115 2 
Asn Ala lie Ala Asp lie He Asn Lys Arg Val Gly Val Gly Phe Val 
370 375 380 

TCC CAC AAA CAT ACA GGA GCT CCT GTT TCA CTT CTA GCC TAC GGC CCA 12 0 0 
Ser His Lys His Thr Gly Ala Pro Val Ser Leu Leu Ala Tyr Gly Pro 
385 390 395 400 

GGT GCA GAG AAT TTT GCA GGC TTT TTA CAC CAT GTA GAT ACG GCA AAG 124 8 
Gly Ala Glu Asn Phe Ala Gly Phe Leu His His Val Asp Thr Ala Lys 

405 410 415 

CTA ATT GCC AAG CTA ATG CTC TTT GGG AAG AAA GAT ATT CCC GTT ACC 12 96 
Leu He Ala Lys Leu Met Leu Phe Gly Lys Lys Asp lie Pro Val Thr 

420 425 430 

ATC TTG GGA ATA AGT GGA GTT AAA GGA GAT ATA ACC GGA GAC TTC AAA 134 4 
He Leu Gly He Ser Gly Val Lys Gly Asp lie Thr Gly Asp Phe Lys 
435 440 445 

GTG GAT GAG CAA GAT GCA TAT GTG ACC TTA ATG ATG TTG CTT GGG GAA 13 92 
Val Asp Glu Gin Asp Ala Tyr Val Thr Leu Met Met Leu Leu Gly Glu 
450 455 460 

AGG GTA GAT ACT GAA CTT GAA AGG AAA GTC GAC ATG AAT AAT AAC GGC 144 0 
Arg Val Asp Thr Glu Leu Glu Arg Lys Val Asp Met Asn Asn Asn Gly 
465 470 475 480 

ATA ATC GAG TTG GGA GAC GTG CTC CTG ATT CTA CAA GAG TCC 14 82 

He lie Glu Leu Gly Asp Val Leu Leu lie Leu Gin Glu Ser 

485 490 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 



Met" 


Lvs 


Lvs 


He 

-i. -i- •■j - 


Thr 


He 


Ser 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


lie 


Ser 


1 

-*- 




5 










10 










15 






Aqn 

*i u ii 


Leu 


Asn 


Leu 


Ala 


Tvr 


Asp 


Ser 


Gin 


Glu 


Ser 


Gly 


He 


Lys 


Asn 








20 










25 










30 






Tie 


lie 

X _k_ <>w 


He 


Leu 

■J-J v^- u. 


He 


Glv 


Asd 


Gly Met 


Gly Met 


Ser 


His 


Val 


Gin 


lie 






35 










40 










45 








Thr 


Lvs 


Leu 

*v 


Val 


Tvr 


Gly 


His 


Leu 


Asn 


Met 


Glu 


Glu 


Phe 


Pro 


lie 


lie 




50 










55 










60 










Glv 


Phe 

■i i 


Glu 


Leu 


Thr 

J- 1 1 J— 


Glu 


Ser 


Leu 


Ser 


Glv 


Glu 


Val 


Thr 


Asp 


Ser 


Ala 


65 










70 










75 










80 


Ala 


Ala 


Glv 


Thr 


Ala 


He 


Ala 


Thr 


Gly 


Val 


Lvs 


Thr 


Tyr 


Asn 


Arg 


Met 








85 










90 










95 




lie 


Ser 


Val 


Thr 

i ii j— 


Asn 


He 


Thr 


Gly 


Lys 


Val 


Thr 


Asn 


Leu 


Thr 


Thr 


Leu 








100 










105 










110 






Leu 

1_J V#L 


Glu 


He 


Ala 


Gin 


Val 


Leu 


Gly 


Lys 


Ser 


Thr 


Gly 


Leu 


Val 


Thr 


Thr 






115 










120 










125 








Thr 


Ara 


He 


Thr 


His 


Ala 


Thr 


Pro 


Ala 


Val 


Phe 


Ala 


Ser 


His 


Val 


Pro 




130 










135 










140 










Aso 


Aro 


Asp 


Met 


Glu 


Glu 


Glu 


He 


Ala 


Arq 


Gin 


Leu 


lie 


Ala 


His 


Arg 


145 










150 










155 










160 


Val 


As n 

ij. 


Val 


Leu 


Leu 


Gly 


Glv 


Gly Arg 


Lys 


Lys 


Phe 


Asp 

It 


Glu 


Asn 


Thr 










165 










170 










175 




Le»u 


TjVS 

1_J Y u 


Met 


Ala 


Lvs 


Glu 


Gin 


Gly 


Tyr 


Asn 


lie 


Val 


Phe 


Thr 


Lys 


Glu 






180 










185 










190 






Gl u 


Leu 


Glu 


Lvs 


Ala 


Glu 


Glv 


Glu 


Phe 


He 


Leu 


Gly 


Leu 


Phe 


Ala 


Asp 

i^ 






195 








200 










205 








k_J J_ 


His 

ill. w 


He 


Pro 


Tvr 


Val 


Leu 


Asp 


Arg 


Lys 


Pro 


Glu 


Asp 


Val 


Gly 


Leu 




210 










215 










220 










l-Jw LI 


Glu 


Met 

1 i^ w 


Thr 


Lys 


Lys 


Ala 


lie 


Ser 


lie 


Leu 


Glu 


Lys 


Asn 


Pro 


Asn 


225 

u fc* J 








230 










235 










240 


Gl V 




Phe 


T.pn 

J-J «wi 


Met 


He 

_i_ _i_ <o 


Glu 


Gly Gly 


Arg 


lie 


Asp 


His 


Ala 


Ala 


His 








245 










250 










255 




Glu 


Asn 




He 


Ala 


Ser 


Val 


Val 


Ala 


Glu 


Thr 


Lvs 


Glu 


Phe 


Asp 


Asp 








2 60 










265 










270 






Val 

v G J. 


Val 


Glv 


Tvr 


Val 


Leu 


Glu 


Tyr 


Ala 


Lys 


Lys 


Arg 


Gly 


Asp 


Thr 


Leu 






275 










280 










285 








Val 


He 

.J- -1 


Val 


Leu 


Ala 


Asp 


His 


Glu 


Thr 


Gly Gly 


Leu 


Gly 


Leu 


Gly 


Leu 




290 










295 










300 










Thr 


Tvr 


Glv 


Asd 


Ala 


He 


Asn 


Glu 


Asp 


Val 


He 


Arg 


Asn 


lie 


Asn 


Ala 


305 










310 










315 










320 


Ser 


Val 


Ser 


Lys 


He 


Ala 


Ser 


Glu 


He 


Arg 


Ala 


Thr 


Asn 


Asp 


lie 


Lys 

-4 








325 










330 










335 




A TO 


Val 

v a. _l 


He 

X _L *w 


Lvs 


Lvs 


Tvr 


Thr 


Gly 


Phe 


Glu 


Leu 


Thr 


Glu 


Asp 


Glu 


lie 








340 










345 










350 








Tvr 
j 


He 

•1. J_ V— ' 


Glu 


Glu 


Ala 


lie 


Asn 


Leu 


Ala 


Asp 


Glu 


Tyr 


Ala 


Leu 


Gin 




355 










360 










365 








Aqn 

O 1 1 


Ala 


He 


Ala 


Asd 


lie 


lie 


Asn 


Lys 


Arg 


Val 


Gly 


Val 


Gly 


Phe 


Val 




3 70 










375 










380 












His 

1 Jr J- 


Lvs 


His 

1 1 -L, *J 


Thr 


Glv 


Ala 


Pro 


Val 


Ser 


Leu 


Leu 


Ala 


Tyr 


Gly 


Pro 


385 








390 










395 










400 


Gly 


Ala 


Glu 


Asn 


Phe 


Ala 


Gly 


Phe 


Leu 


His 


His 


Val 


Asp 


Thr 


Ala 


Lys 








405 










410 










415 




Leu 


He 


Ala 


Lys 


Leu 


Met 


Leu 


Phe 


Gly 


Lys 


Lys 


Asp 


He 


Pro 


Val 


Thr 








420 










425 










430 






He 


Leu 


Gly 


lie 


Ser 


Gly 


Val 


Lys 


Gly 


Asp 


He 


Thr 


Gly 


Asp 


Phe 


Lys 






435 










440 










445 








Val 


Asp 


GlU 


Gin 


Asp 


Ala 


Tyr 


Val 


Thr 


Leu 


Met 


Met 


Leu 


Leu 


Gly 


Glu 




450 










455 










460 










Arg 


Val 


Asp 


Thr 


GlU 


Leu 


Glu 


Arg 


Lys 


Val 


Asp 


Met 


Asn 


Asn 


Asn 


Gly 


465 










470 










475 










480 


lie 


He 


GlU 


Leu 


Gly 


Asp 


Val 


Leu 


Leu 


He 


Leu 


Gin 


Glu 


Ser 







485 



490 



(2) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 



Coding Sequence 
1. . .954 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATG ATT AAC CAA ATA AAC TTC AAA ACC TCT CAT GGA GGA AGC AGA GAA 

Met He Asn Gin lie Asn Phe Lys Thr Ser His Gly Gly Ser Arg Glu 

15 10 15 

GAA GGC TAC ATA AAC TTC TCG GCC TCT GTA AAT CCT TAT CCA CCA GAA 

Glu Gly Tyr He Asn Phe Ser Ala Ser Val Asn Pro Tyr Pro Pro Glu 

20 25 30 



TGG ACT GAT 
Trp Thr Asp 
35 

CCT TAC TAT 
Pro Tyr Tyr 
50 

CCA ATA ACT 
Pro He Thr 
65 

GTT TGG ATG 
Val Trp Met 



GAA ATG TTT GAG AGG GCT AAA AAG ATA AGC ACC TTC TAT 

Glu Met Phe Glu Arg Ala Lys Lys He Ser Thr Phe Tyr 

40 45 

GAA AAG CTT GAG GAA GAA CTC TCA GAT CTA ATT GGG GAG 

Glu Lys Leu Glu Glu Glu Leu Ser Asp Leu He Gly Glu 

55 60 

ATA ACT GCA GGA ATA ACA GAG GCA CTT TAC CTG CTT GGA 

He Thr Ala Gly He Thr Glu Ala Leu Tyr Leu Leu Gly 

70 75 80 

AGG GGT CGG AAA GTA ATA ATC CCG AAG CAC ACC TAT GGG 

Arg Gly Arg Lys Val He He Pro Lys His Thr Tyr Gly 

85 90 95 



GAA TAC GAG AGG ATC TCA CGC ATG TTC GGA GGT AGG GTG ATC AAA GGT 
Glu Tyr Glu Arg He Ser Arg Met Phe Gly Gly Arg Val He Lys Gly 

100 105 HO 



CCC AAT GAC CCA GGA AAG TTA GCA GAA 
Pro Asn Asp Pro Gly Lys Leu Ala Glu 
115 120 

GTG TTC TTC TGC AAT CCA AAC AAT CCA 
Val Phe Phe Cys Asn Pro Asn Asn Pro 
130 135 

AAA GAG ATG AAA CCT CTT TTA GAT GCC 
Lys Glu Met Lys Pro Leu Leu Asp Ala 
145 150 

TTG ATC TTG GAT GAA GCC TTC ATA GAC 
Leu He Leu Asp Glu Ala Phe He Asp 

165 



TTT GTT GAA AGA AAT TCA TTC 
Phe Val Glu Arg Asn Ser Phe 

125 

GAT GGA AAG TTC TAC CGA GAA 
Asp Gly Lys Phe Tyr Arg Glu 
140 

ATT CAA GAC ACT AAC TCA ATT 
He Gin Asp Thr Asn Ser He 
155 160 

TTT GTT AAG AAA CCA GAA AGC 
Phe Val Lys Lys Pro Glu Ser 
170 175 



CCA GAG GGA GAG AAC ATA ATC AGG CTA AGG ACT TTT ACC AAA AGC TAC 
Pro Glu Gly Glu Asn He He Arg Leu Arg Thr Phe Thr Lys Ser Tyr 

180 185 190 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



GGG CTC CCA GGG GTA AGG GTT GGA TAT GTT ATT GGA TTT GTC GAT GCT 62 4 

Gly Leu Pro Gly Val Arg Val Gly Tyr Val He Gly Phe Val Asp Ala 
195 200 205 

TTC AGG AGC GTT AGA ATG CCA TGG TCA ATT GGC TCT ACT GGG GTG GCC 6 72 

Phe Arg Ser Val Arg Met Pro Trp Ser He Gly Ser Thr Gly Val Ala 
210 215 220 

TTC TTA GAG TTC TTA CTC AAA GAT AAC TTC AAA CAC TTA AGA AAA ACC 72 0 

Phe Leu Glu Phe Leu Leu Lys Asp Asn Phe Lys His Leu Arg Lys Thr 
225 230 235 240 

CTC CCC CTA ATA TGG AAA GAA AAG GAG AGG ATT GAG AAA GAA TTG AAA 76 8 

Leu Pro Leu He Trp Lys Glu Lys Glu Arg He Glu Lys Glu Leu Lys 

245 250 255 

GTT AAA AGC GAT GCA AAT TTC TTC ATT ATG AAG GTC AGA GAA GGA ATA 816 
Val Lys Ser Asp Ala Asn Phe Phe He Met Lys Val Arg Glu Gly He 

260 265 270 

ATT GAA AAG CTA AAA GAG AAT GGC ATC CTT GTA AGG GAT TGC AAG AGC 864 
He Glu Lys Leu Lys Glu Asn Gly He Leu Val Arg Asp Cys Lys Ser 
275 280 285 

TTT GGA CTC CCT GGG TAC ATA AGG TTT TCA GTT AGA AGG AGA GAA GAG 912 
Phe Gly Leu Pro Gly Tyr He Arg Phe Ser Val Arg Arg Arg Glu Glu 
290 295 300 

AAT GAC AAA CTC ATA AAC ATC CTT AGA AAA ACA CTT AAT ACT 954 
Asn Asp Lys Leu He Asn He Leu Arg Lys Thr Leu Asn Thr 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 

Met He Asn Gin He Asn Phe Lys Thr Ser His Gly Gly Ser Arg Glu 

15 10 15 

Glu Gly Tyr He Asn Phe Ser Ala Ser Val Asn Pro Tyr Pro Pro Glu 

20 25 30 

Trp Thr Asp Glu Met Phe Glu Arg Ala Lys Lys He Ser Thr Phe Tyr 

35 40 45 

Pro Tyr Tyr Glu Lys Leu Glu Glu Glu Leu Ser Asp Leu He Gly Glu 

50 55 60 

Pro He Thr He Thr Ala Gly He Thr Glu Ala Leu Tyr Leu Leu Gly 
65 70 75 80 

Val Trp Met Arg Gly Arg Lys Val He He Pro Lys His Thr Tyr Gly 

85 90 95 

Glu Tyr Glu Arg He Ser Arg Met Phe Gly Gly Arg Val He Lys Gly 

100 105 HO 

Pro Asn Asp Pro Gly Lys Leu Ala Glu Phe Val Glu Arg Asn Ser Phe 

115 120 125 

Val Phe Phe Cys Asn Pro Asn Asn Pro Asp Gly Lys Phe Tyr Arg Glu 

130 135 140 

Lys Glu Met Lys Pro Leu Leu Asp Ala He Gin Asp Thr Asn Ser He 



1 A C 


lie 








150 










155 










160 


Leu 


Leu 


Asp 


Glu 


Ala 


Phe 


He 


Asp 


Phe 


Val 


Lys 


Lys 


Pro 


Glu 


Ser 










165 










170 








175 




Pro 


Glu 


Gly 


Glu 


Asn 


He 


He 


Arg 


Leu 


Arg 


Thr 


Phe 


Thr 


Lys 


Ser 


Tyr 








ion 










185 










190 






Leu 


Pro 


Gly 


val 


Arg 


Val 


Gly 


Tyr 


Val 


He 


Gly 


Phe 


Val 


Asp 


Ala 


Phe 




IOC 










200 








205 






Arg 


j-fc 

Ser 


Val 


Arg 


Met 


Pro 


Trp 


Ser 


He 


Gly 


Ser 


Thr 


Gly 


Val 


Ala 




z ± U 










215 








220 








Phe 


Leu 


Glu 


Pne 


Leu 


Leu 


Lys 


Asp 


Asn 


Phe 


Lys 


His 


Leu 


Arg 


Lys 


Thr 


Z Z D 










2 3 0 










235 






240 


Leu 


pro 


Leu 


lie 


Trp 


Lys 


Glu 


Lys 


Glu 


Arg 


He 


Glu 


Lys 


Glu 


Leu 


Lys 


vai 


















250 










255 


Lys 


Ser 


Asp 


Ala 


Asn 


Phe 


Phe 


He 


Met 


Lys 


Val 


Arg 


Glu 


Gly 


He 


lie 






260 










265 








270 




Glu 


Lys 


Leu 


Lys 


Glu 


Asn 


Gly 


He 


Leu 


Val 


Arg 


Asp 


Cys 


Lys 


Ser 






275 










280 










285 




Phe 


Gly 


Leu 


Pro 


Gly 


Tyr 


He 


Arg 


Phe 


Ser 


Val 


Arg 


Arg 


Arg 


Glu 


Glu 




290 










295 










300 






Asn 


Asp 


Lys 


Leu 


He 


Asn 


He 


Leu 


Arg 


Lys 


Thr 


Leu 


Asn 


Thr 






305 










310 








315 













